ONNX Model and Tensorrt Engine gives different output for parseq model

keivan.moazami · July 15, 2023, 8:42pm

Description

I converted parseq ocr model from pytorch to onnx and tested it on onnx model and every thing is ok, but when I convert onnx to fp32 or fp16 tensorrt engine, output of the model is very different from onnx model.
I use onnxsim to simplify onnx. if i dont use onnxsim all results are nan.

model repo : GitHub - baudm/parseq: Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

Environment

TensorRT Version: TensorRT-8.6.1.6
GPU Type: RTX 3060
Nvidia Driver Version: 531.79
CUDA Version: cuda-12.0
CUDNN Version: cudnn-8.9.1.23_cuda12
Operating System + Version: win 10
Python Version: 3.8
Onnx opset: 14

Relevant Files

onnx model: https://drive.google.com/file/d/1CRXsD8Zk5Mo50JYCZytrAtBbFm2oOqvc/view?usp=sharing

trtexec.exe --onnx=parseq/test.onnx --workspace=10000 --saveEngine=parseq/test_fp32.trs --verbose
trt engine fp32: https://drive.google.com/file/d/17eecl4QrRrE1BiLqDE8HJT0wZCVm3BkB/view?usp=sharing
trt engine fp32 log: https://drive.google.com/file/d/1i9KkbKainaNIz5QQvolmScIu53DzFHHv/view?usp=sharing

trtexec.exe --onnx=parseq/test.onnx --fp16 --workspace=10000 --saveEngine=parseq/test_fp16.trs --verbose
trt engine fp16: https://drive.google.com/file/d/1CIzRZ-71a2hXZWnMNtWn7k2tuM3Pi6K_/view?usp=sharing
trt engine fp16 log: https://drive.google.com/file/d/15LOBtarM6RZiiyZaz66qt6Z8nu67JyrN/view?usp=sharing

Steps To Reproduce

I wrote a sample code to compare similarity of onnx and trt inference result. when I use real data, mean of similarity is 0.3 and when I use random number it is near 0.85

sample code:
https://drive.google.com/file/d/1dLo9iD3ZUPVuvU6LNFnwQSCjcLDTiKQr/view?usp=sharing
sample real data:
https://drive.google.com/file/d/1VtQgOYw5ZYQSZmUOGyJ7xPKElC7caFMl/view?usp=sharing

keivan.moazami · July 16, 2023, 1:41pm

I have same problem with VitStr based on timm vision transformer

VitStr:

Vision transformer:

github.com/huggingface/pytorch-image-models

timm/models/vision_transformer.py

main

""" Vision Transformer (ViT) in PyTorch

A PyTorch implement of Vision Transformers as described in:

'An Image Is Worth 16 x 16 Words: Transformers for Image Recognition at Scale'
    - https://arxiv.org/abs/2010.11929

`How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers`
    - https://arxiv.org/abs/2106.10270

`FlexiViT: One Model for All Patch Sizes`
    - https://arxiv.org/abs/2212.08013

The official jax code is released and available at
  * https://github.com/google-research/vision_transformer
  * https://github.com/google-research/big_vision

Acknowledgments:
  * The paper authors for releasing code and weights, thanks!
  * I fixed my class token impl based on Phil Wang's https://github.com/lucidrains/vit-pytorch

This file has been truncated. show original

It seems that the problem is vision transformer.

AakankshaS · July 17, 2023, 4:37am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

keivan.moazami · July 17, 2023, 4:54am

It is a bug in 8.6.1 version of tensorrt. and it will be fixed in the next release.

github.com/NVIDIA/TensorRT

TensorRt result is different from onnx model in parseq model

opened 04:31PM - 15 Jul 23 UTC

keivanmoazami

triaged

## Description I converted parseq ocr model from pytorch to onnx and tested it …on onnx model and every thing is ok, but when I convert onnx to fp32 or fp16 tensorrt engine, output of the model is very different from onnx model. I use onnsim to simplify onnx. if i dont use onnxsim all results are nan. model repo : https://github.com/baudm/parseq  ## Environment **TensorRT Version**: TensorRT-8.6.1.6 **NVIDIA GPU**: RTX 3060 **NVIDIA Driver Version**: 531.79 **CUDA Version**: cuda-12.0 **CUDNN Version**:cudnn-8.9.1.23_cuda12 Operating System: Win 10 Python Version: 3.8 PyTorch Version: 1.13 Onnx opset : 14 ## Relevant Files onnx model: https://drive.google.com/file/d/1CRXsD8Zk5Mo50JYCZytrAtBbFm2oOqvc/view?usp=sharing trtexec.exe --onnx=parseq/test.onnx --workspace=10000 --saveEngine=parseq/test_fp32.trs --verbose trt engine fp32: https://drive.google.com/file/d/17eecl4QrRrE1BiLqDE8HJT0wZCVm3BkB/view?usp=sharing trt engine fp32 log: https://drive.google.com/file/d/1i9KkbKainaNIz5QQvolmScIu53DzFHHv/view?usp=sharing trtexec.exe --onnx=parseq/test.onnx --fp16 --workspace=10000 --saveEngine=parseq/test_fp16.trs --verbose trt engine fp16: https://drive.google.com/file/d/1CIzRZ-71a2hXZWnMNtWn7k2tuM3Pi6K_/view?usp=sharing trt engine fp16 log: https://drive.google.com/file/d/15LOBtarM6RZiiyZaz66qt6Z8nu67JyrN/view?usp=sharing ## Steps To Reproduce I wrote a sample code to compare similarity of onnx and trt inference result. when I use real data, mean of similarity is 0.3 and when I use random number it is near 0.85 sample code: https://drive.google.com/file/d/1dLo9iD3ZUPVuvU6LNFnwQSCjcLDTiKQr/view?usp=sharing sample real data: https://drive.google.com/file/d/1VtQgOYw5ZYQSZmUOGyJ7xPKElC7caFMl/view?usp=sharing

Topic		Replies	Views
ONNX Model and Tensorrt Engine gives different output TensorRT tensorrt , onnx	13	5774	June 29, 2022
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1906	September 28, 2023
Onnx -> tensorrt fp32 conversion performance degradation different outputs TensorRT tensorrt , pytorch , onnx	4	2319	November 29, 2022
Tensorrt8.5 inference different with origin onnx model TensorRT	5	1578	January 23, 2023
Onnx output differs largely to TRT engine output TensorRT	14	2208	February 25, 2023
Output from ONNX inference and trt inference are different Jetson TX2 tensorrt , tensorflow , nvbugs	5	998	May 14, 2021
Tensorrt8.5 inference different with origin onnx model TensorRT	6	1215	December 13, 2022
Inference result gets worse when converting pytorch model to TensorRT model TensorRT pytorch	6	1355	January 19, 2022
tensorRT inference unstable compared onnxruntime TensorRT	4	1507	May 4, 2021
ONNX model and TensorRT engine works differently TensorRT	4	878	February 2, 2023