Triton inference server and inference on different machines. #5769

prasad-nair · 2023-05-10T16:01:41Z

prasad-nair
May 10, 2023

Is it possible to run Triton on one device and do inferences of models on other devices.

ApoorveK · 2023-06-05T06:54:28Z

ApoorveK
Jun 5, 2023

@prasad-nair You might need a container orchestration tool setup (Kubernetes) where you can have instances of Triton on different devices which is being managed as worker node by kubernetes master node server. Apart from it, other ways would add latency to inference time hence might not be useful. (but like to know more about them)

0 replies

okyspace · 2023-06-17T04:50:50Z

okyspace
Jun 17, 2023

I recalled we might be able to add more than one model repos to the same triton server.

With this if u are hosting models in another device and able to mount as one repo in triton, it might be possible?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triton inference server and inference on different machines. #5769

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Triton inference server and inference on different machines. #5769

prasad-nair May 10, 2023

Replies: 2 comments

ApoorveK Jun 5, 2023

okyspace Jun 17, 2023

prasad-nair
May 10, 2023

ApoorveK
Jun 5, 2023

okyspace
Jun 17, 2023