Incorrect inference results after converting from ONNX to TRT with trtexec

olliebeeke · November 10, 2022, 1:17pm

Description

A model converted from ONNX to TensorRT does not produce the correct inference results. A dynamically-shaped TF model for pose estimation (MoveNet multipose) was converted to ONNX format using tf2onnx converter. I verified that the ONNX model gives the same output as the TF model. I converted the ONNX model to TRT format using trtexec. The generated TRT model does not give the same output as the ONNX model: the model output is the correct shape, but the values are incorrect.

Can you please provide guidance to try and get the TRT model working correctly?

Environment

TensorRT Version: 8.5.7.1
GPU Type: RTX 3060 (Laptop)
Nvidia Driver Version: Studio Driver 522.30
CUDA Version: 11.7 & 11.8
CUDNN Version: 8.6.0
Operating System + Version: Windows 10 (19044.2251)
Python Version (if applicable): 3.9.6
Baremetal or Container (if container which image + tag): Bare metal

Relevant Files

The files necessary to reproduce this are in this Google Drive

Steps To Reproduce

Download the ONNX model to the working directory.
Use trtexec as follows: ‘C:\Program Files\NVIDIA\TensorRT\v8.5.1.7\bin\trtexec.exe’ --onnx=model.onnx --saveEngine=model.trt --shapes=input:1x192x256x3
Run the test script with both models on a test image whose shape is close to 192x256.
The two models produce different results.

spolisetty · November 15, 2022, 5:40am

Hi,

We are unable to successfully run the script you’ve shared in the command line. Facing QT related errors.
However, we could reproduce the accuracy difference between TensorRT and ONNX-Runtime. Please allow us some time to work on this.

[V] Loaded Module: tensorrt | Version: 8.5.0.12 | Path: ['/usr/local/lib/python3.8/dist-packages/tensorrt']

[I] Accuracy Comparison | trt-runner-N0-11/15/22-05:35:13 vs. onnxrt-runner-N0-11/15/22-05:35:13

[I]
[E]         FAILED | Output: 'output_0' | Difference exceeds tolerance (rel=1e-05, abs=1e-05)
[E]     FAILED | Mismatched outputs: ['output_0']
[E] Accuracy Summary | trt-runner-N0-11/15/22-05:35:13 vs. onnxrt-runner-N0-11/15/22-05:35:13 | Passed: 0/1 iterations | Pass Rate: 0.0%
[E] FAILED | Runtime: 68.875s | Command: /usr/local/bin/polygraphy run model.onnx --trt --onnxrt --workspace=200G --verbose

Thank you.

olliebeeke · November 15, 2022, 5:57pm

Thank you for getting back to me.

I received an email (doesn’t seem to be shown on this thread for some reason) suggesting I check out the following documentation:

and recommending that I “write new plugins or refactor existing ones to target the IPluginV2DynamicExt or IPluginV2IOExt interfaces instead.”.

I’m not sure how this helps me - could you please elaborate a little? I’m relatively new to deep learning in general, let alone TensorRT, so any extra guidance would be greatly appreciated!

olliebeeke · December 5, 2022, 1:25pm

Hi,

Did you manage to make any progress with my issue?

elight1 · December 9, 2022, 9:39am

Make sure you are loading the correctly trained model.

Another thing you can try to investigate is try your code on a simpler model you know it works.

Topic		Replies	Views
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1568	September 28, 2023
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	1003	February 28, 2023
ONNX Model Int64 Weights TensorRT	12	13389	February 17, 2024
The inference result of Conformer Encoder is wrong TensorRT	6	1240	March 2, 2022
Tensorrt8.5 inference different with origin onnx model TensorRT	5	1403	January 23, 2023
ONNX to TRT Engine conversion Error TensorRT tensorrt	8	3715	May 25, 2022
ONNX to TensorRT conversion (FP16 or FP32) results in integer outputs being mapped to near negative infinity (~2e-45) TensorRT tensorrt , cuda , onnx , aws , natural-language-processing-nlp , nlp	3	3346	June 6, 2022
TensorRT 8 : C++ inference gives different results compared to tensorflow python inference TensorRT	7	1358	October 5, 2021
Issues with ONNX to TensorRT Conversion for the Faster R-CNN Mobilenet V3 Model TensorRT tensorrt , python , onnx	4	1817	July 7, 2023
Wrong results when I convert Movenet models from ONNX to TensorRT TensorRT tensorrt , onnx	5	1221	September 28, 2023

Incorrect inference results after converting from ONNX to TRT with trtexec

Description

Environment

Relevant Files

Steps To Reproduce

Related topics