TensorRT 4.0.1 for amd64 Vs. TensorRT 4.0.2 for TX2

orong13 · December 31, 2018, 9:17am

Hello,
I’m interesting to know what are the differences between these two TensorRT versions from the following point of views:

IEEE-754 standard usage
FMA usage
Fast mode usage
FP Optimizations

I’m asking that because when I’m running the same CNN model that was developed with Tensorflow on both platforms:

amd64 with Ubuntu 16.04, TensorRT 4.0.1, CuDNN 7.1.4, CUDA 9.0, GeForce 1080, Display driver 410.72
TX2, Jetpack 3.2.1 with tensorRT 4.0.2, CuDNN 7.1.5, CUDA 9.0

Despite the fact that both of them are work properly and the final detection’s parameters are equal,
When I compare the model binaries outputs (32 bits FP) I see that there is a significant accuracy gap between them.

I have the ability to run the model without TensorRT but directly via Tensorflow C++ and CuDNN.
When I compare the Tensorflow binaries outputs that were generated in both platforms I cannot see this accuracy gap at all.

Please advise.

NVES · January 1, 2019, 9:55pm

Hello,

This is not expected. To help us debug, can you please share a small repro containing how you converted the TF CNN model to TRT, inference code, CNN model, and interference test cases that demonstrate the accuracy gaps you are seeing?

regards,
NVIDIA Enterprise Support.

Topic		Replies	Views
TensorRT 6.0.1 performs worse than TensorRT 5.1.6 on Jetson AGX Xavier Jetson AGX Xavier	4	1169	October 18, 2021
TensorRT 4.X support Tensorflow ConcatV2 vs. TensorRT 5.X support Tensorflow ConcatV2 TensorRT	0	755	April 27, 2019
WARNING:tensorflow:TensorRT mismatch TensorRT	3	1299	January 17, 2020
TensorRT with Tensorflow Evaluation on 1080Ti TensorRT	1	2354	December 11, 2019
Memory error for tensorRT model on TX2 Jetson TX2 tensorrt	6	1465	January 5, 2022
Recap on tensorflow object detection API on TX2 Jetson TX2	9	4747	October 18, 2021
Correctness Problem Using Tensorflow with RTX 4090 Frameworks cuda , tensorflow	2	1316	May 15, 2023
TensorRT inference time much faster than cuDNN TensorRT	5	1623	February 22, 2022
Tensorflow 1.7 with TensorRT fails Jetson TX2	13	3821	October 18, 2021
TensorRT Sigmoid activation function produces slightly different result from original TensorFlow sigmoid op. Jetson TX2	8	2033	March 22, 2018

TensorRT 4.0.1 for amd64 Vs. TensorRT 4.0.2 for TX2

Related topics