SampleNMT computes incorrect attention vectors in TensorRT 5.0.2.6

zhihao5xgp6 · January 4, 2019, 5:02pm

It seems that sampleNMT incorrectly computes the attention vectors from the context vectors. There should be a tanh non-linear activation layer after W[c:h], as shown in Equation 3 GitHub - tensorflow/nmt: TensorFlow Neural Machine Translation Tutorial. To fix the bug, you should add a tanh activation at the end of SLPAttention::addToModel.

NVES · January 4, 2019, 6:10pm

Thank you for the feedback. I’ll bring this to our engineering team’s attention.

NVES · January 10, 2019, 3:45pm

Hello,

our engineers have committed the fix (Add missing tanh to sampleNMT attention) and should be available in a future release.

thank you

Topic		Replies	Views
sample_nmt segmentation fault TensorRT	2	811	July 10, 2018
Introduction to Neural Machine Translation with GPUs (part 3) Technical Blog	36	830	August 19, 2018
TF-TRT RNN NMT model optimise, Input tensor with shape [?,?] TensorRT	0	648	May 29, 2019
Activation layer TensorRT	0	397	June 13, 2018
Customize TensorRT 4 LSTM TensorRT	3	2616	September 11, 2018
Use TenserRT2.1 for LSTM layer with peephole and projection GPU-Accelerated Libraries	0	498	September 21, 2017
I don't get similar results with TensorRT and the trained tensorflow model! Jetson TX2	20	4600	October 18, 2021
TensorRT3 gives wrong result for alphago model GPU-Accelerated Libraries	0	468	January 16, 2018
tensorRT transpose and RNN TensorRT	0	543	April 18, 2019
TRT Error Repeated tensor name: AttentionOcr_v1/sequence_logit_fn/SQLR/LSTM/attention_decoder/lstm_cell/split_1 TensorRT	11	1127	July 30, 2020

SampleNMT computes incorrect attention vectors in TensorRT 5.0.2.6

Related topics