Benchmark Results

This benchmark evaluates the performance improvements achieved by integrating Tritonserver-rs into a video processing pipeline.

Setup

The benchmark uses models to:

Detect objects in a video frame.
Perform regression tasks on the detected objects.

The test video resolution was Full HD (1920x1080). The model used for inference was lightweight, but the frame flow in the pipeline was very high, simulating a demanding real-world application where processing efficiency is critical.

Methods Compared

The models were executed using four different methods:

Dedicated Triton Server: Requests sent via gRPC.
Python Triton Library: Direct CUDA memory transfer.
Tritonserver-rs: Leveraging local execution.
DeepStream SDK: Optimized for video pipelines.

Tests were conducted across various GPUs, and the table below shows the average frames per second (FPS) processed by each method:

GPU	Triton (gRPC)	Triton (Shared Memory)	Tritonserver-rs	DeepStream SDK
Tesla T4	70	105	200	320
RTX 3090 Ti	80	140	360	450
A10	75	115	270	330
A100	80	130	330	400

Key Observations

Performance Gains:
- Tritonserver-rs outperformed the dedicated Triton Server by a factor of 3–4x compared to gRPC-based communication.
- Compared to the Python Triton library with shared memory, Tritonserver-rs delivered 2x the performance.
Comparison with DeepStream:
While DeepStream SDK achieves the highest FPS due to its specialization in video processing, it comes at the cost of flexibility and broader model support. Tritonserver-rs offers a balanced trade-off, combining significant performance improvements with flexibility for various use cases.
Hardware Agnosticism:
Model execution on different GPUs required no additional configuration. This demonstrates the adaptability and ease of deployment of Tritonserver-rs across a wide range of hardware.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BENCH.md

BENCH.md

Benchmark Results

Setup

Methods Compared

Key Observations

Files

BENCH.md

Latest commit

History

BENCH.md

File metadata and controls

Benchmark Results

Setup

Methods Compared

Key Observations