Assertion Error while Creating TensorRT Engine from ONNX PTQ Model

jwher96 · February 27, 2025, 5:52am

I encountered an issue when attempting to create a TensorRT engine from an ONNX PTQ model on the NVIDIA Orin platform. I used the onnx_ptq tool provided by NVIDIA.

During the engine creation process, I received the following error:

[02/27/2025-14:24:37] [V] [TRT] Removing /stages/stages.3/downsample/attn/q/Add_output_0_QuantizeLinear
[02/27/2025-14:24:37] [V] [TRT] Removing /stages/stages.2/blocks/blocks.14/Add_1_output_0_DequantizeLinear_2
[02/27/2025-14:24:37] [V] [TRT] Removing stages.3.downsample.attn.q.local.weight_DequantizeLinear
[02/27/2025-14:24:37] [V] [TRT] ConstWeightsFusion: Fusing stages.3.downsample.attn.q.local.weight + stages.3.downsample.attn.q.local.weight_QuantizeLinear with /stages/stages.3/downsample/attn/q/local/Conv
[02/27/2025-14:24:37] [E] Error[2]: [graphOptimizer.cpp::fusePattern::1909] Error Code 2: Internal Error (Assertion matchPattern(context, first) && matchBackend(context, first) failed. )
[02/27/2025-14:24:37] [E] Engine could not be created from network
[02/27/2025-14:24:37] [E] Building engine failed
[02/27/2025-14:24:37] [E] Failed to create engine from model or file.
[02/27/2025-14:24:37] [E] Engine set up failed

This issue seems similar to this post, but I’m using a much newer version of TensorRT.

Step To Reproduce

efficientformerv2_l.ptq.zip (93.4 MB)

/usr/src/tensorrt/bin/trtexec --verbose --onnx=efficientformerv2_l.ptq.onnx --saveEngine=efficientformerv2_l.engine --timingCacheFile=timing.cache --fp16 --int8

Thank you in advance for your support!

Environment

Platform : Orin
Jetpack Version : 6.2+b77
TensorRT Version : 10.7
CUDA Version : 12.6.85
CUDNN Version : 9.3
Operating System + Version : Ubuntu 22.04 Jammy Jellyfish
Baremetal or Container (if container which image + tag): baremetal

AastaLLL · February 27, 2025, 7:33am

Hi,

Could you try to run the ONNX model with ONNXRuntime to see if there is any issue first?
Thanks.

jwher96 · February 27, 2025, 9:08am

I resolved the issue by increasing the builderOptimizationLevel to 4.

trtexec --verbose --onnx=efficientformerv2_l.ptq.onnx --saveEngine=efficientformerv2_l.engine --timingCacheFile=timing.cache --builderOptimizationLevel=4 --fp16 --int8

I’m not exactly sure how it works, but hopefully, this can help someone facing a similar issue…

AastaLLL · March 3, 2025, 5:16am

Hi,

Thanks a lot for sharing this info:

$ /usr/src/tensorrt/bin/trtexec -h
  --builderOptimizationLevel         Set the builder optimization level. (default is 3)
                                     Higher level allows TensorRT to spend more building time for more optimization options.
                                     Valid values include integers from 0 to the maximum optimization level, which is currently 5.

Setting the builderOptimizationLevel larger than 3 allows TensorRT to use more time to find an algorithm.

Thanks.

system · March 17, 2025, 5:17am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Failed to create tensorrt engine from QAT onnx model Jetson AGX Orin tensorrt , onnx	3	1017	January 16, 2023
Internal Error while Creating TensorRT Engine from Quantized ONNX Model Jetson AGX Orin tensorrt , cudnn	4	24	March 26, 2025
TensorRT problem on NVIDIA APEX ORIN NX TensorRT tensorrt , jetson-inference , cudnn	1	36	August 29, 2024
Assertion engine != nullptr failed Jetson AGX Orin tensorrt	8	2037	March 10, 2023
Orin AGX TensorRT conversion from torch.onnx.export fails Jetson AGX Orin tensorrt , pytorch	6	1595	December 21, 2022
FAILED TensorRT.trtexec TensorRT	1	2473	October 4, 2021
Assertion Error in buildMemGraph: 0 (mg.nodes[mg.regionIndices[outputRegion]].size == mg.nodes[mg.regionIndices[inputRegion]].size) TensorRT	10	1292	October 12, 2021
Keras->Onnx->TensorRT Jetson AGX Orin tensorrt	4	121	September 25, 2024
TensorRT quantization bug on Jetpack 6.0 Jetson AGX Orin tensorrt , pytorch	6	601	January 22, 2024
Error Code 2: Internal Error (Assertion matchPattern(context, first) && matchBackend(first) failed. ) TensorRT	7	1786	January 13, 2023

Assertion Error while Creating TensorRT Engine from ONNX PTQ Model

Step To Reproduce

Environment

Related topics