Accelerating Inference with NVIDIA Triton Inference Server and NVIDIA DALI

jwitsoe · April 13, 2021, 9:19pm

Originally published at: https://developer.nvidia.com/blog/accelerating-inference-with-triton-inference-server-and-dali/

When you are working on optimizing inference scenarios for the best performance, you may underestimate the effect of data preprocessing. These are the operations required before forwarding an input sample through the model. This post highlights the impact of the data preprocessing on inference performance and how you can easily speed it up on the…

sufiyan · March 21, 2022, 5:32pm

Though the article explains how to decode images on server side very well, and it worked like charm, but I am struggling to find a similar thing for videos.

Inferencing on video datasets is even more network intensive, and with the likes of ffmpeg-python, I am able to encode a 32/64 frame sequences into an encoded h264 byte stream. If this can be decoded in dali on server side to make the required NTHWC tensor, it would be great.
After spending hours, I could only find Video readers that read from file, but nothing that can read videos from external_source

jlisiecki · March 22, 2022, 10:37pm

Hi @sufiyan,

Thank you for checking DALI. That is true, it doesn’t support video decoding with TRITON.
What you can do is to check out DeepStream which aims to handle video streaming and provides integration with TRITON.

Topic		Replies	Views
Fast AI Data Preprocessing with NVIDIA DALI Technical Blog	0	412	August 25, 2020
Case Study: ResNet50 with DALI Technical Blog	0	466	August 25, 2020
Serving ML Model Pipelines on NVIDIA Triton Inference Server with Ensemble Models Technical Blog	1	533	July 13, 2023
Deploying Models from TensorFlow Model Zoo Using NVIDIA DeepStream and NVIDIA Triton Inference Server Technical Blog	13	1180	May 25, 2022
Designing an Optimal AI Inference Pipeline for Autonomous Driving Technical Blog	0	274	November 30, 2022
Deploying Models from TensorFlow Model Zoo Using NVIDIA DeepStream and NVIDIA Triton Inference Server DeepStream SDK	3	8911	February 29, 2024
Choosing a Server for Deep Learning Inference Technical Blog	0	424	May 12, 2022
Deploying AI Deep Learning Models with NVIDIA Triton Inference Server Technical Blog	0	394	December 18, 2020
Deploying GPT-J and T5 with FasterTransformer and Triton Inference Server Technical Blog	7	1009	April 19, 2023
Accelerating Medical Image Processing with NVIDIA DALI Technical Blog	0	379	January 19, 2022

Accelerating Inference with NVIDIA Triton Inference Server and NVIDIA DALI

Related topics