TensorRT 4 Accelerates Neural Machine Translation, Recommenders, and Speech

Originally published at: https://developer.nvidia.com/blog/tensorrt-4-accelerates-translation-speech-recommender/

NVIDIA has released TensorRT 4 at CVPR 2018. This new version of TensorRT, NVIDIA’s powerful inference optimizer and runtime engine provides: New Recurrent Neural Network (RNN) layers for Neural Machine Translation apps New Multilayer perceptron (MLP) operations and optimizations for Recommender Systems Native ONNX parser to import models from popular deep learning frameworks Integration with TensorFlow…