🤖 Real time inference with GPMA #65

nithinmanoj10 · 2023-06-18T12:34:07Z

nithinmanoj10
Jun 18, 2023
Maintainer

Exploring ways to integrate GPMA with Seastar that will provide it to be very useful. We are looking at fast real time inference of GNN models with minimal latency and fast update times.

nithinmanoj10 · 2023-06-18T12:42:37Z

nithinmanoj10
Jun 18, 2023
Maintainer Author

What is real-time inference?

First, a deep neural network (DNN) model specifically designed for the problem domain and data available is trained, usually on a GPU or high-performance CPU cluster for anywhere from tens of hours to a few weeks. Then, it is deployed into a production environment where it takes in a continuous stream of input data and runs inference in real time, yielding output either directly used as the end result or further fed into downstream systems.

Either way, applications that have an ever stricter latency requirement, driverless cars, and search engines, for instance, demand lightning-fast deep learning inference, usually within tens of milliseconds for each sample. Thus, beyond the academia’s typical focus on faster training, the industry is often more concerned with faster inference, bringing inference acceleration to the spotlight and core of many hardware and software solutions.

It is used in an emerging cloud service known as MLaaS - Machine Learning as a Service

Difference between Deep Learning Training and Inference
Real-time inference architecture design inspiration
Real time inference for fraud detection using GNNs by Amazon

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🤖 Real time inference with GPMA #65

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

🤖 Real time inference with GPMA #65

nithinmanoj10 Jun 18, 2023 Maintainer

Replies: 1 comment

nithinmanoj10 Jun 18, 2023 Maintainer Author

What is real-time inference?

nithinmanoj10
Jun 18, 2023
Maintainer

nithinmanoj10
Jun 18, 2023
Maintainer Author