Neural Machine Translation Inference with TensorRT 4

jwitsoe · July 18, 2018, 5:01pm

Originally published at: Neural Machine Translation Inference with TensorRT 4 | NVIDIA Technical Blog

Neural machine translation exists across a wide variety consumer applications, including web sites, road signs, generating subtitles in foreign languages, and more. TensorRT, NVIDIA’s programmable inference accelerator, helps optimize and generate runtime engines for deploying deep learning inference apps to production environments. NVIDIA released TensorRT 4 with new features to accelerate inference of neural machine…

anon71548953 · August 17, 2018, 4:39am

Hi, which GPU this blog used. I found that this blog said at the beginning 'Google’s Neural Machine Translation (GNMT) model performed inference 60x faster using TensorRT on Tesla V100 GPUscompared to CPU-only platforms' but '1 GPU: Tesla P4(GP104), Driver=r384.125, CPU = E5-2690 v4@2.60GHz 3.5GHz Turbo (Broadwell) HT On, Threads=56, Sockets=2, FP32. CPU-only configuration: Skylake Gold 6140@2.30GHz 3.7GHz Turbo (Skylake); HT Off; Sockets: 2; Threads: 36, FP32' at the end of this blog.

anon93638682 · September 24, 2018, 7:00pm

Detailed machine spec at the end of the blog corresponds to sampleNMT measurement, which was performed on a Tesla P4 GPU.
GNMT performance was on a Tesla V100 GPU as you mentioned above. Hope that clarifies.

Topic		Replies	Views
Neural Machine Translation Now Available with TensorRT Technical Blog	0	292	August 21, 2022
TensorRT 4 Accelerates Neural Machine Translation, Recommenders, and Speech Technical Blog	0	415	August 25, 2020
NVIDIA Releases TensorRT 4 Technical Blog	0	305	August 21, 2022
Double PyTorch Inference Speed for Diffusion Models Using Torch-TensorRT Technical Blog	1	53	July 24, 2025
TensorRT 3: Faster TensorFlow Inference and Volta Support Technical Blog	0	283	August 21, 2022
Which program are used to test Inference Image Classification on CNNs with TensorRT? TensorRT	0	416	September 12, 2019
Which program are used to test Inference Image Classification on CNNs with TensorRT? TensorRT	0	424	September 11, 2019
Inferencing Images 100x Faster with GPUs and TensorRT Technical Blog	0	249	August 21, 2022
TensorRT availability in a TITAN V GPU-Accelerated Libraries	0	747	February 26, 2018
TensorRT TensorRT	1	500	August 26, 2021

Neural Machine Translation Inference with TensorRT 4

Related topics