Video Tutorial: Accelerating Inference Performance of Recommendation Systems with TensorRT

jwitsoe · August 21, 2022, 11:45pm

Originally published at: Video Tutorial: Accelerating Inference Performance of Recommendation Systems with TensorRT | NVIDIA Technical Blog

NVIDIA TensorRT is a high-performance deep learning inference optimizer and runtime that delivers low latency and high-throughput for deep learning inference applications. You can import trained models from every deep learning framework into TensorRT, and easily create highly efficient inference engines that can be incorporated into larger applications and services. This video demonstrates the steps…

Topic		Replies	Views
Accelerating Recommendation System Inference Performance with TensorRT Technical Blog	0	263	August 25, 2020
Video: Introduction to Recurrent Neural Networks in TensorRT Technical Blog	1	377	January 5, 2020
TensorRT 3: Faster TensorFlow Inference and Volta Support Technical Blog	0	258	August 21, 2022
Get the Best Performance for Your Neural Networks with TensorRT Technical Blog	0	253	August 21, 2022
Production Deep Learning Inference with TensorRT Inference Server Technical Blog	0	286	August 21, 2022
Video Tutorial: Introduction to Recurrent Neural Networks in TensorRT Technical Blog	0	300	August 21, 2022
RESTful Inference with the TensorRT Container and NVIDIA GPU Cloud Technical Blog	0	248	August 21, 2022
NVIDIA open sources parsers and plugins in TensorRT Technical Blog	0	264	August 21, 2022
TensorRT 4 Accelerates Neural Machine Translation, Recommenders, and Speech Technical Blog	0	380	August 25, 2020
New Update to the NVIDIA Deep Learning SDK Now Help Accelerate Inference Technical Blog	0	279	November 8, 2021

Video Tutorial: Accelerating Inference Performance of Recommendation Systems with TensorRT

Related topics