TensorRT gives diffent results than ONNX and Pytorch

erez.h · July 17, 2023, 4:25pm

Description

When creating a TensorRT engine from an ONNX file, and comparing the inference outputs from the two formats I receive different results (The difference is significant and not due to precision/optimizations).

Environment

TensorRT Version: 8.6.1.0
GPU Type: NVIDIA RTX A3000
Nvidia Driver Version: 535.54.03
CUDA Version: 12.2
CUDNN Version: 8.9.2
Operating System + Version: Ubuntu 20.04
Python Version (if applicable): 3.8.10
TensorFlow Version (if applicable): N/A
PyTorch Version (if applicable): 2.0.1+cu118
Baremetal or Container (if container which image + tag): N/A

Relevant Files

model_640x640_FULL_STAT.onnx - ONNX file
model_640x640_fp16_stat.engine - TensorRT engine file
comp_onnx_trt.py - Python script the loads and compare the results for the 2 formats.
zidane.jpg - file used as input to the networks.

Steps To Reproduce

Build tensorRT engine from ONNX file :

trtexec --onnx=model_640x640_FULL_STAT.onnx --saveEngine=model_640x640_fp16_stat.engine --useSpinWait --fp16
( I also tried without --fp16)
2. run the attached python script (comp_onnx_trt.py): loads the ONNX and trt models, inject the same image file (attached) as input to them, compare the results and print the maximum difference between them.
3, The printed result is : max diff: 6.824637
4. when I compare the ONNX results to Pytorch using the same input, the difference is low (~0.01)

comp_onnx_trt.py (3.1 KB)
model_640x640_fp16_stat.engine (5.2 MB)
model_640x640_FULL_STAT.onnx (7.9 MB)

spolisetty · July 24, 2023, 6:38am

Hi @erez.h,

We could reproduce the issue. This is a known issue and will be fixed in future major releases. As a temporary workaround, we recommend you to use the FP32 precision.

Thank you.

kalecikli1478 · July 24, 2023, 8:16am

Can we get engine output with the py file in this attachment? Also, when I run the cmd code, I get the error tensorrt trtexec (tensorrt v8502)usr/src/… is there a solution?
note=what model did you use exactly?

erez.h · July 30, 2023, 7:44am

Hi,
I also tried using FP32 precision (removing “–fp16” flag) and the results are the same.
Do I need to use different configuration for the trtexec ? (than "trtexec --onnx=model_640x640_FULL_STAT.onnx --saveEngine=model_640x640_stat.engine --useSpinWait)
Thanks,
Erez

spolisetty · August 2, 2023, 12:08pm

The above command should work fine.
If we don’t specify precision, TensorRT will use the “FP32” precision by default.

erez.h · August 3, 2023, 4:42am

Thanks.
As I wrote I already tried this and got the same results

erez.h · August 10, 2023, 5:48am

Hi
Can you please check if you have a valid workaround for this issue?
Also - when do you expect a version with a fix to be released ?

daniel.massonfurlan · September 5, 2023, 7:31am

Hi!

I have the same problem.

Great difference from pytorch and tensorrt. It would be great to have some feedback for upcoming releases.

grydenisbak · September 28, 2023, 11:25am

Hello. The same problem

TensorRT Version : 8.6.1.0
GPU Type : NVIDIA 2060
Nvidia Driver Version : 470.182.03
CUDA Version : 11.4
CUDNN Version : 8.4.0
Operating System + Version : Ubuntu 20.04
Python Version (if applicable) : 3.10.12
TensorFlow Version (if applicable) : N/A
PyTorch Version (if applicable) : 1.12.1+cu113
Baremetal or Container (if container which image + tag) : N/A

Topic		Replies	Views
Incorrect inference results after converting from ONNX to TRT with trtexec TensorRT tensorrt , python , onnx	4	1578	December 9, 2022
ONNX Model and Tensorrt Engine gives different output for parseq model TensorRT onnx	4	1210	July 17, 2023
ONNX Model and Tensorrt Engine gives different output TensorRT tensorrt , onnx	4	726	March 21, 2023
TensorRT 8 : C++ inference gives different results compared to tensorflow python inference TensorRT	7	1358	October 5, 2021
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1397	November 17, 2022
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	1001	February 28, 2023
Use pre-trained object detection TF2 models with TensorRT ONNX TensorRT	9	1933	May 31, 2021
ONNX Model and Tensorrt Engine gives different output TensorRT tensorrt , onnx	13	5399	June 29, 2022
LSTM ONNX to TensorRT mismatched outputs TensorRT tensorrt	3	963	September 29, 2022
TensorRT Engine Creation Methods’ Differences TensorRT tensorrt	1	423	September 27, 2023

TensorRT gives diffent results than ONNX and Pytorch

Description

Environment

Relevant Files

Steps To Reproduce

Related topics