How to deal with conversion error from torch to tensorrt

andhover · January 26, 2022, 2:52pm

Hi, community. I converted my pytorch model with custom layer from pytroch to tensorrt through torch2trt (GitHub - NVIDIA-AI-IOT/torch2trt: An easy to use PyTorch to TensorRT converter).
So I write custom plugin for tensorrt and custom converter for torch2trt.

To keep it simple I took lenet and convert it to tensorrt and after that measured the error like this:

# create some regular pytorch model...
model = LeNet.eval().cuda()

# create example data
x = torch.ones((1, 1, 32, 32)).cuda()

# convert to TensorRT feeding sample data as input
model_trt = torch2trt(model, [x])
y = model(x)
y_trt = model_trt(x)

# check the output against PyTorch
print(torch.max(torch.abs(y - y_trt)))

And the error is very small like 4e-7/
After that I tried to convert lenet with my custom layers and the error is like 0.018

But when I load weights and measure the error it becomes even higher.

model_path = 'models/lenet5_mnist.pt'
lenet5_model.load_state_dict(torch.load(model_path))
lenet5_model.eval().cuda()

# create example data
x = torch.ones((1, 1, 32, 32)).cuda()

# convert to TensorRT feeding sample data as input
model_trt = torch2trt(lenet5_model, [x]) 


y = lenet5_model(x)
print(torch.max(torch.abs(y - y_trt)))

And the error is around 2.5:

tensor(2.5567, device='cuda:0', grad_fn=<MaxBackward1>)

I’ve tried inference on several images and make sure that there is a great drop in accuracy.

I’m trying to debug this and wanted to print architecture.
for pytorch model I can simple write:

print(lenet5_model)

but when I try on wrapped tensorrt model:

print(model_trt)

it outputs the only:

TRTModule()

Tried to check model weights, for torch is ok:

print(lenet5_model.state_dict()['_body.0.weight'])

But tensorrt outputs smth unreadable:

print(model_trt.state_dict())

So my question is how to check model graph, weights etc in tensorrt?
Or may be some tips how to debug my convertation?

NVES · January 26, 2022, 3:08pm

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec
In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

andhover · January 27, 2022, 5:08am

Thanks for sharing. Although I’m converting model directly from pytorch to tensorrt using torch2trt, not ONNX

spolisetty · January 31, 2022, 8:31am

Hi,

We recommend you to please post your concern on Issues · NVIDIA-AI-IOT/torch2trt · GitHub to get better help.

Thank you.

Topic		Replies	Views
Unable to convert ONNX model to TensorRT TensorRT tensorrt , pytorch , onnx	6	3487	September 30, 2020
How convert pytorch model that have mutiple parallel inputs to tensorrt ？ TensorRT	1	719	June 14, 2022
Issues with ONNX to TensorRT Conversion for the Faster R-CNN Mobilenet V3 Model TensorRT tensorrt , python , onnx	4	1800	July 7, 2023
Tensorrt8.5 inference different with origin onnx model TensorRT	5	1401	January 23, 2023
Error Code 1: Internal Error (Error: Weights of same values but of different types are used in the network!) TensorRT tensorrt , onnx , jetson	4	1219	July 7, 2023
Pytorch model to trt engine, sometimes crash TensorRT tensorrt	9	498	March 10, 2021
TensorRT encountered issues when converting weights between types and that could affect accuracy TensorRT	7	1754	September 22, 2023
TensorRT with BART TensorRT	3	1145	January 17, 2022
Pytorch -> ONNX -> TensorRT inference with terrible accuracy (int64 clamped to int32) TensorRT cudnn	2	1348	January 23, 2024
Conversion fails with `[weightsPtr.h::setCount::144] Error Code 2: Internal Error (Assertion count >= 0 failed.)` TensorRT	2	335	April 24, 2023

How to deal with conversion error from torch to tensorrt

check_model.py

Related topics