Better doc for different between timeout and client_timeout of grpc_client.infer #7369

ShuaiShao93 · 2024-06-24T23:24:48Z

Is your feature request related to a problem? Please describe.
In the doc, it says for timeout, it's only only respected by the model that is configured with dynamic batching, and the server can take model-specific actions. But it's not clear what model-specific actions are. Does it terminate the request for all models? How is this different from client_timeout?

Describe the solution you'd like
Update the doc with the details below:

What are model-specific actions?
Does client_timeout also terminate all requests?
Why is timeout only respected by the model that is configured with dynamic batching?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better doc for different between timeout and client_timeout of grpc_client.infer #7369

Better doc for different between timeout and client_timeout of grpc_client.infer #7369

ShuaiShao93 commented Jun 24, 2024

Better doc for different between timeout and client_timeout of grpc_client.infer #7369

Better doc for different between timeout and client_timeout of grpc_client.infer #7369

Comments

ShuaiShao93 commented Jun 24, 2024