Deploying AI Deep Learning Models with NVIDIA Triton Inference Server

jwitsoe · December 18, 2020, 3:30am

Originally published at: https://developer.nvidia.com/blog/deploying-ai-deep-learning-models-with-triton-inference-server/

In the world of machine learning, models are trained using existing data sets and then deployed to do inference on new data. In a previous post, Simplifying and Scaling Inference Serving with NVIDIA Triton 2.3, we discussed inference workflow and the need for an efficient inference serving solution. In that post, we introduced Triton Inference…

Topic		Replies	Views
Fast and Scalable AI Model Deployment with NVIDIA Triton Inference Server Technical Blog	0	414	November 9, 2021
Deploying Models from TensorFlow Model Zoo Using NVIDIA DeepStream and NVIDIA Triton Inference Server Technical Blog	13	1178	May 25, 2022
Deploying Models from TensorFlow Model Zoo Using NVIDIA DeepStream and NVIDIA Triton Inference Server DeepStream SDK	3	8906	February 29, 2024
How to Deploy an AI Model in Python with PyTriton Technical Blog	1	571	January 4, 2024
One-click Deployment of Triton Inference Server to Simplify AI Inference on Google Kubernetes Engine (GKE) Technical Blog	0	523	August 23, 2021
Accelerated Inference for Large Transformer Models Using FasterTransformer and Triton Inference Server Technical Blog	1	550	August 10, 2023
Deploying GPT-J and T5 with FasterTransformer and Triton Inference Server Technical Blog	7	1000	April 19, 2023
Develop ML and AI with Metaflow and Deploy with NVIDIA Triton Inference Server Technical Blog	2	355	January 5, 2024
Choosing a Server for Deep Learning Inference Technical Blog	0	420	May 12, 2022
Designing an Optimal AI Inference Pipeline for Autonomous Driving Technical Blog	0	274	November 30, 2022

Deploying AI Deep Learning Models with NVIDIA Triton Inference Server

Related topics