Tensorrt8.5 inference different with origin onnx model

liwenjudetiankong · October 24, 2022, 7:37am

Description

i have trained a model with pytorch and export it to onnx format model, this onnx model predicts correct value. but when i convert it to tensorrt engine, it gives wrong inference.
i use polygraphy to check the difference of the two. my command is :
polygraphy run detector_corrector.onnx --trt --onnxrt
–workspace=24G
–verbose
–fail-fast

here is the output:
test_diff.output (236.2 KB)

i use the image of tensorrt :
nvcr.io/nvidia/tensorrt 22.09-py3 dcf7a448b6aa

here is my model:

Environment

TensorRT Version: 8.5.0.12
GPU Type: NVIDIA GeForce RTX 3090
Nvidia Driver Version: 470.129.06
CUDA Version: cuda_11.8.r11.8
CUDNN Version: 8
Operating System + Version: Ubuntu 20.04.5 LTS
Python Version (if applicable): 3.8.10
TensorFlow Version (if applicable):
PyTorch Version (if applicable): 1.12.1
Baremetal or Container (if container which image + tag):

AakankshaS · October 28, 2022, 7:45am

Hi @liwenjudetiankong ,
Apologies for delayed response.
Please allow us sometime to check on this.
Thank you for your patience.

spolisetty · November 29, 2022, 5:19am

Hi @liwenjudetiankong,

Could you please elaborate more on the above highlighted.
Are you looking for 100% numerical accuracy in comparison with ONNX-Runtime results?
Or do you have a benchmarking metric for inference accuracy?

Our engineers think the polygraphy issue is just a tolerance issue. With “–atol 1e-4 --rtol 1e-4”, the polygraphy check could get passed, while the default value is “–atol 1e-5 --rtol 1e-5”.

Thank you.

liwenjudetiankong · December 5, 2022, 3:46am

please reference this post, Tensorrt8.5 inference different with origin onnx model
i have upload test code, onnx and trt runtime just give totally diffrent result.

spolisetty · December 6, 2022, 5:38am

Hi @liwenjudetiankong,

Could you also please give more info on requested.

Thank you.

spolisetty · January 23, 2023, 5:25am

Based on our debugging, we do not think this is a TRT issue.
It is simply a floating-point arithmetic error; the user should include a postprocessing script in the code and run it on a real dataset, comparing the output sentence rather than the numeric output.

Thank you.

Topic		Replies	Views
Onnx vs tensorrt different inference result TensorRT	3	2920	November 29, 2022
TensorRT 8 : C++ inference gives different results compared to tensorflow python inference TensorRT	7	1338	October 5, 2021
8bit quantized onnx file and its 8bit engine inference results differ TensorRT tensorrt	2	682	November 21, 2021
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	958	February 28, 2023
Tensorrt 8.6 GA : C++ Inference gives diffrence results compared to onnx \|\| pt model python inference TensorRT	3	615	September 20, 2023
Different engines give different inference results when using the same onnx model and giving the same input TensorRT	4	909	December 31, 2023
Two machines with very similar SW stack but different GPUs generate different folded model using the Polygraphy tool on the same model onnx input TensorRT	7	804	June 22, 2022
Incorrect inference results after converting from ONNX to TRT with trtexec TensorRT tensorrt , python , onnx	4	1533	December 9, 2022
Differences between tensorflow model inference and tensorRT model inference TensorRT tensorrt , tensorflow	6	1665	September 14, 2022
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1512	September 28, 2023

Tensorrt8.5 inference different with origin onnx model

Description

Environment

Related topics