Trtexec convert dla model fail

parkerpang1997 · September 23, 2024, 12:53pm

Jetson Agx Orin 32G
JetPack 6.0
TensorRT 8.6.2.2

I use trtexec tool to convert onnx model to dla engine. Here is the input of my model:

The command is

/usr/src/tensorrt/bin/trtexec --onnx=model_fp_input/at_simplified_noqdq.onnx --calib=model_fp_input/qat_simplified_precision_config_calib.cache --useDLACore=0 --int8 --fp16 --saveEngine=model_fp_input/qat_simplified_standalone.dla --precisionConstraints=prefer --layerPrecisions=/normalize_input/Conv:fp16 --inputIOFormats=fp16:dla_hwc4 --outputIOFormats=fp16:chw16 --buildDLAStandalone

It returns fail with info as follow:

I want to know the reason why it failed and how should I modified my model if I want to using fp16:dla_hwc4 as model input since I can only offer fp16 and nhw4 data in my project and I don’t want to use preprocessing outside the model. Besides, uint8 and nhw4 input data is also available, but I think it can’t be passed to dla directly.

I have tried to remove the red box and using int8:dla_hwc4 input and trtexec worked successfully. But this model was not suitable for my project.

AastaLLL · September 24, 2024, 3:40am

Hi,

DLA doesn’t support quantization layers. Do you need it in the model?
Usually, users adopt the PTQ strategy on Jetson rather than QAT.

Below is the DLA support matrix for your reference:

Thanks.

parkerpang1997 · September 25, 2024, 2:11am

Hi,

The Onnx model with QDQ node is translalted to PTQ model with calibration cache using qdq_translator tools.

Here is the attributes of the first convolution layer and input shape is 8x3x352x640 (NCHW). I want to use fp16:dla_hwc4 input format, pass input to first fp16 conv layer and then quantize to int8. Thus the input for dla engine can be 8x352x640x4 (fp16).

I think my model and the config meet the input format requirements, but conversion failed.
Could you help me analyze the reasons for this failure?

parkerpang1997 · September 25, 2024, 3:05am

Is the group for the first convolutional layer required to be 1?

AastaLLL · September 25, 2024, 9:08am

Hi,

Does the model work when adding the GPU fallback?
Based on the log, it requires a reformat layer to convert the data type which usually is done by GPU.

Thanks.

system · October 23, 2024, 4:55am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Trtexec failed to generate engine (Internal Error) with DLA Jetson Orin NX tensorrt , nvbugs , dla	7	927	April 8, 2024
Error when use DLA with yoloV4 tensorRT Jetson AGX Xavier tensorrt , dla	7	905	May 10, 2023
Observing different output when running layer on DLA vs running on GPU Jetson AGX Xavier dla	5	1055	June 25, 2021
Resize incompatibility when generating a full TensorRT engine for DLA Jetson Orin NX tensorrt , dla	8	686	February 23, 2023
Trtexec problem Jetson AGX Xavier tensorrt , jetson-inference	6	1684	September 27, 2021
Cannot build a TensorRT engine for DLA because Constant_output_0 is not supported in DLA Jetson AGX Orin tensorrt , dla	8	202	July 23, 2024
Jetson Orin: All layers pushed to GPU, zero layers on DLA Jetson AGX Orin tensorrt , dla	7	1023	April 26, 2023
Running yolox with relu activation on dla gives me incorrect result Jetson Xavier NX dla	6	868	May 30, 2022
Failure in conversion for DLA TensorRT	3	479	July 21, 2021
Pre-trained model in DLA/TENSORCORES Jetson AGX Orin dla , kb	5	806	May 17, 2023

Trtexec convert dla model fail

Related topics