NVIDIA Announces TensorRT 6; Breaks 10 millisecond barrier for BERT-Large

Originally published at: NVIDIA Announces TensorRT 6; Breaks 10 millisecond barrier for BERT-Large | NVIDIA Technical Blog

Today, NVIDIA released TensorRT 6 which includes new capabilities that dramatically accelerate conversational AI applications, speech recognition, 3D image segmentation for medical applications, as well as image-based applications in industrial automation.  TensorRT is a high-performance deep learning inference optimizer and runtime that delivers low latency, high-throughput inference for AI applications.  With today’s release, TensorRT continues…