-
Notifications
You must be signed in to change notification settings - Fork 280
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Managing Compute Resources for Containers #747
Comments
Hi, @jeffjunzhang .
I will try to add a new feature to manage k8s resources to Clipper. Thanks for your good point. |
Thank you, @withsmilo After the modification, we should re-compile all the source codes from scratch, right? |
@jeffjunzhang , |
@withsmilo Thanks again. Basically, say if I have 1 pytorch model to deploy with Clipper, I want to limit the model container with 10 cores+ some memory, and also 10 cores for the query front-end container and management container. What's the simplest way to achieve these? |
@jeffjunzhang , |
(1) In clipper, when we deploy a model container, e.g., a trained pytorch model, how do we set the CPU and memory limits for it? Is this supported by Clipper admin? If not, any solution to achieve this goal?
(2) Even when we start the clipper cluster, can we set a CPU/memory limit on the whole cluster? Also, is there a way to set the CPU/memory limits for query_frontend and mange_frontend as well?
Thank you for your help!
The text was updated successfully, but these errors were encountered: