Failed to create tensorrt engine from QAT onnx model

jolly.ming2005 · January 13, 2023, 8:38am

Description

I am debuging QAT with pytorch_quantization(TensorRT/tools/pytorch-quantization at main · NVIDIA/TensorRT · GitHub).
The onnx model with QDQ from pth can be convertered successfully.
But when i am trying to convert the simple ONNX model to TensorRT, it failed.

[01/13/2023-08:20:26] [V] [TRT] Removing QuantizeLinear_70
[01/13/2023-08:20:26] [V] [TRT] Removing DequantizeLinear_41
[01/13/2023-08:20:26] [V] [TRT] Removing DequantizeLinear_44
[01/13/2023-08:20:26] [V] [TRT] ConstWeightsFusion: Fusing conv23.weight + QuantizeLinear_43 with Conv_45
[01/13/2023-08:20:26] [E] Error[2]: [graphOptimizer.cpp::fusePattern::1777] Error Code 2: Internal Error (Assertion matchPattern(context, first) && matchBackend(first) failed. )
[01/13/2023-08:20:26] [E] Error[2]: [builder.cpp::buildSerializedNetwork::636] Error Code 2: Internal Error (Assertion engine != nullptr failed. )
[01/13/2023-08:20:26] [E] Engine could not be created from network
[01/13/2023-08:20:26] [E] Building engine failed
[01/13/2023-08:20:26] [E] Failed to create engine from model or file.
[01/13/2023-08:20:26] [E] Engine set up failed

Environment

I test tensorrt in Docker

Platform : Orin
Jetpack Version : 5.0.2-b231
TensorRT Version : 8.4.1
CUDA Version : 11.4
CUDNN Version : 8.4.1
Operating System + Version : ubuntu20.04
Python Version (if applicable) : 3.8.10
PyTorch Version (if applicable): 1.10
Baremetal or Container (if container which image + tag): dustynv/ros:noetic-ros-base-l4t-r35.1.0(GitHub - dusty-nv/jetson-containers: Machine Learning Containers for NVIDIA Jetson and JetPack-L4T))

Relevant Files

0.pth.onnx (173.1 KB)
0.pth.onnx.log (98.0 KB)

Steps To Reproduce

trtexec --verbose --nvtxMode=verbose --buildOnly --workspace=8192 --onnx=0.pth.onnx --saveEngine=0.pth.onnx.engine --timingCacheFile=./timing.cache --profilingVerbosity=detailed --fp16 --int8

another same topic: https://forums.developer.nvidia.com/t/error-code-2-internal-error-assertion-matchpattern-context-first-matchbackend-first-failed/223247/7

spolisetty · January 13, 2023, 12:22pm

Hi,

We could build TensorRT engine successfully on Tesla V100 GPUs.

[01/13/2023-12:02:23] [I] [TRT] [MemUsageChange] TensorRT-managed allocation in engine deserialization: CPU +0, GPU +0, now: CPU 0, GPU 0 (MiB)
[01/13/2023-12:02:23] [I] Engine deserialized in 0.0136184 sec.
[01/13/2023-12:02:23] [I] Skipped inference phase since --buildOnly is added.
&&&& PASSED TensorRT.trtexec [TensorRT v8501] # trtexec --verbose --nvtxMode=verbose --buildOnly --workspace=8192 --onnx=0.pth.onnx --saveEngine=0.pth.onnx.engine --timingCacheFile=./timing.cache --profilingVerbosity=detailed --fp16 --int8

Please try on the latest TensorRT version 8.5.2 and let us know if you still face this issue. We are moving this post to the Jetson AGX Orin forum if you need further help.

Thank you.

jolly.ming2005 · January 16, 2023, 11:51am

@ spolisetty
it works on the TensorRT version 8.5.1(I build the tensorrt engine with docker: nvcr.io/nvidia/tensorrt:22.12-py3).

Can you explain more about this error?

Another question:
How to upgrade the tensorrt version from 8.4.1 to 8.5.1(or 8.5.2) on orin.

Thanks!

system · January 30, 2023, 11:52am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Failed to convert quantized onnx model to engine TensorRT cudnn , jetson	4	120	July 28, 2024
FAILED TensorRT.trtexec TensorRT	1	2443	October 4, 2021
Build engine from onnx failed TensorRT	2	965	December 14, 2021
ONNX to TensoRT conversion failing with error: "each train expected to have at most one ShapeHostToDeviceNode" Jetson Xavier NX tensorrt , pytorch	9	934	August 16, 2023
Assertion engine != nullptr failed Jetson AGX Orin tensorrt	8	1947	March 10, 2023
TensorRT problem on NVIDIA APEX ORIN NX TensorRT tensorrt , jetson-inference , cudnn	1	27	August 29, 2024
Error Code 2: Internal Error (Assertion matchPattern(context, first) && matchBackend(first) failed. ) TensorRT	7	1710	January 13, 2023
Error while working with trtexec to create an engine with onnx file TensorRT	6	1553	July 14, 2022
AttributeError: 'NoneType' object has no attribute 'create_execution_context'.How to create resnettrt.pth from a resnet.pth Jetson Nano tensorrt , containers	13	1416	January 18, 2024
Building TensorRT 8 engine from ONNX quantized model fails TensorRT	4	885	October 1, 2021

Failed to create tensorrt engine from QAT onnx model

Description

Environment

Relevant Files

Steps To Reproduce

Related topics