Accelerating Inference with NVIDIA Triton Inference Server and NVIDIA DALI

Originally published at: Accelerating Inference with NVIDIA Triton Inference Server and NVIDIA DALI | NVIDIA Developer Blog

When you are working on optimizing inference scenarios for the best performance, you may underestimate the effect of data preprocessing. These are the operations required before forwarding an input sample through the model. This post highlights the impact of the data preprocessing on inference performance and how you can easily speed it up on the…