Inference on TensorRT engine with trtexec

Tungdil99 · March 22, 2024, 8:35am

Description

I’m using a publicly available computer vision model called UIU-Net. When I use the given code for inference on the pretrained model they share I get logical results. But when I convert this pretrained pytorch model to tensorrt the inference pipeline I want to build doesn’t work as expected and the results do not match at all.
In order to convert the pytorch model to tensorrt engine, I first convert it to an onnx model and the onnx model I got works as expected too, but converting this onnx model to tensorrt engine and running inference with “trtexec” doesnt work.
You can find my scripts and steps to reproduce down below. I believe my way of saving the output of “trtexec” and then converting this resultant json to an image is faulty but I’m open to any advice.
Fİnally keep in mind that my question is about creating an inference pipeline rather than something specific to this model. I believe you reproduce this with any CNN model on any image.

Thanks in advance

Environment

TensorRT Version: 8.5.2-1+cuda11.4
GPU Type: Jetson AGX Orin
Nvidia Driver Version: NVIDIA UNIX Open Kernel Module for aarch64 35.4.1
CUDA Version: 11.4
CUDNN Version: 8.6
Operating System + Version: 5.10.120-tegra - Ubuntu 20.04
Python Version (if applicable): 3.8.2
TensorFlow Version (if applicable):
PyTorch Version (if applicable): 2.1.0a0+41361538.nv23.06
Baremetal or Container (if container which image + tag):

Relevant Files

upload.zip (2.5 KB)
torch2onnx.py: Script I use for torch to onnx model conversion
infer-onnx.py: Run inference on the resultant onnx model for sanity check
img2dat.py: Converts .png image to binary .dat file for inference
json2img.py: Converts the output .json file back to .png image

Steps To Reproduce

Download pretrained weights from here
Run torch2onnx.py
Download any .png image (8-bit, 512x512 preferred)
Run infer-onnx.py (Optional)
Run img2dat.py
Execute “trtexec --onnx=uiu-net.onnx --saveEngine=uiu-net-fp32.trt”
Execute “trtexec --loadEngine=uiu-net-fp32.trt --loadInputs=input.1:input_tensor.dat --exportOutput=frame1000.json”
Run json2img.py

AakankshaS · March 30, 2024, 11:00am

I am trying a repro from my end, and shall update you.
Thanks

Topic		Replies	Views
Run engine trt file on image/video Jetson TX2 tensorrt	8	1710	October 18, 2021
BUG: Output TRT engine from trtexec has completely different inference than input model TensorRT tensorrt , debugging-and-troubleshooting	3	2388	January 4, 2022
Inference result gets worse when converting pytorch model to TensorRT model TensorRT pytorch	6	1332	January 19, 2022
Incorrect inference results after converting from ONNX to TRT with trtexec TensorRT tensorrt , python , onnx	4	1738	December 9, 2022
TensorRT model inference result is not correctly TensorRT tensorrt , tensorflow , onnx	1	709	July 1, 2022
Engine Plan Inference on JetsonTX2 Jetson TX2 tensorrt , python	11	2000	October 18, 2021
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	1089	February 28, 2023
ONNX to TensorRT conversion TensorRT	3	825	July 6, 2023
Jetson Orin Nano TensorRT Jetson Orin Nano tensorrt	7	251	August 6, 2024
Tensorrt inference in real time TensorRT tensorrt , python	1	668	March 13, 2023

Inference on TensorRT engine with trtexec

Description

Environment

Relevant Files

Steps To Reproduce

Related topics