Simplifying AI Inference in Production with NVIDIA Triton

jwitsoe · April 12, 2021, 5:00pm

Originally published at: https://developer.nvidia.com/blog/simplifying-ai-inference-in-production-with-triton/

AI machine learning is unlocking breakthrough applications in fields such as online product recommendations, image classification, chatbots, forecasting, and manufacturing quality inspection. There are two parts to AI: training and inference. Inference is the production phase of AI. The trained model and associated code are deployed in the data center or public cloud, or at…

tgt_kz · June 17, 2021, 8:10am

Is it possible to use Triton server for real-time object detection and recognition from video?
Or TensorRT or something else is more suitable?

shankarc · June 22, 2021, 12:56am

have you looked at NVIDIA DeepStream SDK? Triton & TensorRT are integrated in it and DeepStream has a variety of features for real time object detection from video streams.

tgt_kz · November 19, 2021, 6:53am

Thank you @shankarc !
I missed a notification in my inbox. Yeah, we are converting everything to deepstream now.

Topic		Replies	Views
Solving AI Inference Challenges with NVIDIA Triton Technical Blog	0	405	September 21, 2022
Deploying AI Deep Learning Models with NVIDIA Triton Inference Server Technical Blog	0	412	December 18, 2020
Simplifying and Scaling Inference Serving with NVIDIA Triton 2.3 Technical Blog	0	427	October 5, 2020
Fast and Scalable AI Model Deployment with NVIDIA Triton Inference Server Technical Blog	0	434	November 9, 2021
Simplifying AI Inference with NVIDIA Triton Inference Server from NVIDIA NGC Technical Blog	3	485	October 29, 2020
Fast and Scalable AI Model Deployment with NVIDIA Triton Inference Server Data Science of the Day ai , fun-facts , inference-server-triton	0	1051	December 2, 2021
One-click Deployment of Triton Inference Server to Simplify AI Inference on Google Kubernetes Engine (GKE) Technical Blog	0	535	August 23, 2021
NVIDIA Triton Inference Server Boosts Deep Learning Inference Technical Blog	0	295	August 21, 2022
How to Deploy an AI Model in Python with PyTriton Technical Blog	1	590	January 4, 2024
Optimizing and Serving Models with NVIDIA TensorRT and NVIDIA Triton Technical Blog	1	403	July 20, 2022

Simplifying AI Inference in Production with NVIDIA Triton

Related topics