Trtexec : Error[4]: [graphShapeAnalyzer.cpp::analyzeShapes::1294] Error Code 4: Miscellaneous

Harry-S · June 21, 2022, 4:40pm

Description

Hello, I am using yolov5s.onnx ONNX model to run inference using trtexec CLI but I am facing an error (see below) when changing batch size.

Environment

TensorRT Version: trtexec command line interface
GPU Type: JEtson AGX ORIN
Nvidia Driver Version:
CUDA Version: 11.4
CUDNN Version: 8.3.2.49
Operating System + Version: ubuntu 20.04
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): baremetal

Relevant Files

Yolov5s ONNX model :
D_yolov5s.onnx (28.0 MB)

Steps To Reproduce

When you see the model on netron here you can see that it has Nx3x640x640 as input shapes and this means that you have a dynamic batch size because it has N So when running it using trtexec and change the batch size it will give me an error.

COMMAND:

$ /usr/src/tensorrt/bin/trtexec --onnx=D_yolov5s.onnx --int8 --explicitBatch --shapes=images:2x3x640x640 --verbose

here for example I put 2 as batch size but you can try using like 8, 16 or 32 etc…

NOTE:

I tested with batch equal to 1 and it worked, only with batch higher than one it will give me an error.

Thank you in advance.

Best regard,
Harry

NVES · June 21, 2022, 5:07pm

Hi, Please refer to the below links to perform inference in INT8

Thanks!

Harry-S · June 22, 2022, 7:04am

Hello @NVES ,

Thank you for your reply. But even when using FP32 using this command below:

/usr/src/tensorrt/bin/trtexec --onnx=D_yolov5s.onnx --explicitBatch --shapes=images:2x3x640x640 --verbose

you will have the same error, please try to reproduce it by running the command on your Jetson AGX Orin and you will see the problem clearly.

Thank you.

Best regard,
Harry

spolisetty · June 24, 2022, 11:39am

Hi,

Looks like input node “images” do not have dynamic shape input(it’s defined as static input), that’s why it is working fine with batch size 1.
Please generate the ONNX model with dynamic shape input. You can also modify the ONNX model.

Thank you.

Harry-S · June 27, 2022, 7:14am

Hello @spolisetty ,

Thank you for your answer, if you look on netron I modified the ONNX model into dynamic shapes so input node “images” support Nx3x640x640 so N is a dynamic batch size. So normaly it should works.

Thank you.

Best regard,
Harry

spolisetty · June 28, 2022, 5:56am

Hi,

It’s because the Reshape op has hard-coded shapes [1, 3, 85, 20, 20], which should have been [-1, 3, 85, 20, 20].

Could please let us know how you exported the ONNX model from PyT/TF? Do you use the dynamic_axes argument as in (optional) Exporting a Model from PyTorch to ONNX and Running it using ONNX Runtime — PyTorch Tutorials 1.12.1+cu102 documentation?

If you already did it, then you need to modify the Reshape of the PyT code from [1, 3, 85, 20, 20] to [-1, 3, 85, 20, 20].

Thank you.

Harry-S · June 28, 2022, 2:59pm

Hello @spolisetty ,

Thank you for your response, I used an other methode I hard coded the input shapes in to Nx3x640x640 wich apparently is not the right methode to do it.

So I have to try two other methodes:

I will use this GiHub repo to download the ONNX model from pytorch using the script export.py you can add the flag --dynamic but when adding this option you will have indeed a dynamic ONNX model when verifiying in netron. But I had other errors in trtexec, I will share with you the error log as soon as possible after I run it on the Jetson AGX ORIN using trtexec.
I will change the Reshape to [-1, 3, 85, 20, 20] as you suggested to me.

Thank you very much for your help, I will keep you updated.

Best regard,
Harry

Harry-S · June 29, 2022, 3:12pm

Hello @spolisetty ,

This is my dynamic yolov5s ONNX model below:
yolov5s.onnx (27.6 MB)

when I run it using trtexec as before I have this error:

I used the GitHub repo here and add the --dynamic option to get the ONNX model in dynamic shapes, I verified the model on netron as well it is indeed dynamic shapes, you can verified as well.

Thank you in advance.

Best regard,
Harry

spolisetty · July 1, 2022, 10:08am

Hi @Harry-S,

We couldn’t reproduce the same error. We could successfully build the TensorRT engine.

[07/01/2022-10:00:49] [I]
&&&& PASSED TensorRT.trtexec [TensorRT v8401] # /opt/tensorrt/bin/trtexec --onnx=yolov5s.onnx --shapes=images:2x3x640x640--verbose

We recommend you to please use the latest version of the TensorRT 8.4 GA.

Thank you.

Harry-S · July 1, 2022, 1:53pm

Hello @spolisetty ,

Thank you very much for your reply.

I am using indeed TensorRT 8.4 see in the photo below.

spolisetty · July 4, 2022, 11:26am

Hi,

We have tried on TensorRT version: 8.4.1.
Maybe you’re still using the 8.4 EA version, please upgrade your TensorRT version to 8.4 GA.
https://developer.nvidia.com/nvidia-tensorrt-8x-download

Also please try increasing the workspace size as some tactics need more workspace memory to run.

Thank you.

Harry-S · July 5, 2022, 3:07pm

Hello @spolisetty ,

I updated the TensorRT as you suggested to me and it worked see photo below:

However, I am facing a new problem that CUDA is not installed, see below:

But CUDA is indeed installed see below with nvcc -V :

NOTE :

I update the system as well as suggested after installing it using debian package here and finaly ran this command : $ sudo apt-get update sudo apt-get install tensorrt libcudnn8

Do I need to install CUDA from here I think it is the mix of version between cuDNN, CUDA and TensorRT because it was working fine before updating TensorRT to the latest version and I am using JetPack 5.0 DP see below the version of JetPack from this command : $ sudo apt-cache show nvidia-jetpack

EDIT:

Actually I am using JetPack 5.0.1 apprently:

image1471×536 37.3 KB

Thank you in advance.

Best regard,
Harry

spolisetty · July 7, 2022, 1:41pm

Hi,

This looks like setup related issue on the Jetson.
We recommend you to please open a new post regarding setup issue on Jetson related forum to get better help.

Thank you.

Harry-S · July 7, 2022, 1:57pm

Okay, thank you I will do it and put a link here so people can see because it was working fine before updating the trtexec.

Harry

EDIT:

here is the link to the new topic : CUDA is not working after updating trtexec

Topic		Replies	Views
TensorRT Batch Inference: different results TensorRT	4	4205	December 1, 2021
Having trouble converting Pytorch Faster-RCNN to TensorRT Engine TensorRT	4	1994	September 13, 2022
TensorRT 7 ONNX models with variable batch size TensorRT kb	13	12019	October 12, 2021
Regarding the information about modifying sample code to convert the prebuilt onnx models and perform inferencing Jetson Nano	10	604	September 5, 2023
Could not parse ONNX model from file TensorRT	9	3673	January 24, 2024
Regarding doubts about deepstream custom parser for onnx with deepstream batch DeepStream SDK gstreamer , deepstream	5	37	September 14, 2024
Onnx with dynamic batch cannot be parsed TensorRT tensorrt	12	1521	August 9, 2021
Driver error-TensorRT INT8 deploy TensorRT	3	699	November 20, 2020
Trtexec create engine failed from onnx when adding dynamic shapes TensorRT	5	2122	June 22, 2021
ONNX to tensorRT conversion Jetson Nano tensorrt	13	4610	August 16, 2023

Trtexec : Error[4]: [graphShapeAnalyzer.cpp::analyzeShapes::1294] Error Code 4: Miscellaneous

Description

Environment

Relevant Files

Steps To Reproduce

COMMAND:

NOTE:

EDIT:

EDIT:

Related topics