Profile/pressure analisys best practices #34

bhack · 2023-06-01T20:27:50Z

Can you add a clear list of best practices to test the sidecar resource allocation? From the current documentation it seems to be a little bit "black magic".
It is hard to understand on real workload if the sidecar is under pressure and it requires additional memory and CPU limits.

bhack · 2023-06-04T21:32:52Z

See also #35

songjiaxun · 2023-06-09T17:51:21Z

Thanks for the question. This is a great suggestion. Our team will discuss how to provide best practices guidance and update you here.

bhack · 2023-11-24T16:34:42Z

@songjiaxun Do you think that this could be solved by your comment at #61 (comment) ?

bhack · 2024-01-26T20:53:04Z

@songjiaxun As an alternative to unlimited resources do you think that we could occasionally log the the CPU occupancy so that we could fine-tune resource reservation?

bhack · 2024-04-19T20:20:25Z

As on autopilot we need to limit the CPU resources assigned to the sidecar how we could be sure that the sidecar is not a bottleneck?
On deep learning jobs we have dataloaders/dataworkers that require assigned CPU Resource on the main container. These need to cooperated with the sidecar resources to transfer file, preprocessing and feed GPUs.

We need to have a reliable way to understand when the sidecar is the bottleneck or instead it is on the dataloaders/datowrkers assigned resources.

bhack mentioned this issue Jun 1, 2023

Profile/pressure analisys best practices GoogleCloudPlatform/gcsfuse#1159

Closed

songjiaxun added the documentation Improvements or additions to documentation label Jul 14, 2023

songjiaxun mentioned this issue Feb 22, 2024

On Autopilot clusters, you cannot upload files larger than 10Gi #21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Profile/pressure analisys best practices #34

Profile/pressure analisys best practices #34

bhack commented Jun 1, 2023

bhack commented Jun 4, 2023

songjiaxun commented Jun 9, 2023

bhack commented Nov 24, 2023

bhack commented Jan 26, 2024

bhack commented Apr 19, 2024

Profile/pressure analisys best practices #34

Profile/pressure analisys best practices #34

Comments

bhack commented Jun 1, 2023

bhack commented Jun 4, 2023

songjiaxun commented Jun 9, 2023

bhack commented Nov 24, 2023

bhack commented Jan 26, 2024

bhack commented Apr 19, 2024