Originally published at: NVIDIA Announces TensorRT 6; Breaks 10 millisecond barrier for BERT-Large | NVIDIA Technical Blog
Today, NVIDIA released TensorRT 6 which includes new capabilities that dramatically accelerate conversational AI applications, speech recognition, 3D image segmentation for medical applications, as well as image-based applications in industrial automation. TensorRT is a high-performance deep learning inference optimizer and runtime that delivers low latency, high-throughput inference for AI applications. With today’s release, TensorRT continues…