Dynamically adjust instances and/or dynamic batching vs HPA #5954

okyspace · 2023-06-17T04:27:12Z

okyspace
Jun 17, 2023

I believe HPA helps to scale at triton server level to allow more resources to handle ALL deployed models.

At model level, instances and dynamic batching might help. However I cannot find anywhere to dynamically config these to meet dynamic model level performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamically adjust instances and/or dynamic batching vs HPA #5954

{{title}}

Replies: 0 comments

Select a reply

Dynamically adjust instances and/or dynamic batching vs HPA #5954

okyspace Jun 17, 2023

Replies: 0 comments

okyspace
Jun 17, 2023