Layer to support heterogeneous inference #19653

wilderfield · 2024-02-26T19:39:52Z

wilderfield
Feb 26, 2024

Does ONNX-runtime support the idea of running inference of a model heterogeneously? For instance, I've heard CoreML does.

Something like:
evaluate the computational graph of a model, the data types used, and the operations required, then determine the most efficient way to execute the model across CPU, GPU, 3rd party accelerator. This includes considerations for power efficiency, computational speed, and memory usage. Dynamically choosing the best processing unit for a given task without explicit instructions from the developer would be a significant advantage, particularly for applications where performance and power efficiency are critical.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Layer to support heterogeneous inference #19653

{{title}}

Replies: 0 comments

Select a reply

Layer to support heterogeneous inference #19653

wilderfield Feb 26, 2024

Replies: 0 comments

wilderfield
Feb 26, 2024