Optimizing and Accelerating AI Inference with the TensorRT Container from NVIDIA NGC

jwitsoe · August 25, 2020, 11:53pm

Originally published at: Optimizing and Accelerating AI Inference with the TensorRT Container from NVIDIA NGC | NVIDIA Technical Blog

Natural language processing (NLP) is one of the most challenging tasks for AI because it needs to understand context, phonics, and accent to convert human speech into text. Building this AI workflow starts with training a model that can understand and process spoken language to text. BERT is one of the best models for this…

Topic	Replies	Views
Training and Fine-tuning BERT Using NVIDIA NGC Technical Blog	453	August 25, 2020
NVIDIA Achieves 4X Speedup on BERT Neural Network Technical Blog	267	August 21, 2022
Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated) Technical Blog	518	July 20, 2021
Learn How to Build Transformer-Based Natural Language Processing Applications Technical Blog	338	August 21, 2022
NVIDIA Slashes BERT Training and Inference Times Technical Blog	277	August 21, 2022
NVIDIA Makes BERT Fly Technical Blog	246	August 21, 2022
TensorRT 7: Accelerate End-to-end Conversational AI with New Compiler Technical Blog	382	July 28, 2021
Increasing Inference Acceleration of KoGPT with NVIDIA FasterTransformer Technical Blog	397	April 25, 2023
Get the Best Performance for Your Neural Networks with TensorRT Technical Blog	253	August 21, 2022
Microsoft Research Unveils New BERT-Based Biomedical NLP AI Model Technical Blog	227	August 21, 2022

Optimizing and Accelerating AI Inference with the TensorRT Container from NVIDIA NGC

Related topics