Help for converting .onnx to TensorRT

viraldoshi321 · April 9, 2024, 2:53am

Hi I am a beginner in this field so I require some help. I have a NVIDIA AGX Orin 64 GP Developer Kit and want to implement Realtime Facial Recognition on it. I read the documentation (Developer Guide) but have some queries regarding model conversion

Can I convert any model from .onnx to tensorRT engine (eg. arcFace, Resnet100, etc.) using trtexec ??
If my TensorRT is utilizing the GPU properly is the above conversion the most efficient way to run the model ?
Is the usage of –useDLACore=0 while making the tensorRT engine enough to utilize DLA or do I have to menton anything when running these models on DeepStream ?

AastaLLL · April 9, 2024, 5:14am

Hi,

1. In general yes, but if your model contains a special layer, TensorRT might not have a corresponding implementation for it.
2. Yes, efficient in both performance and memory.
3. You can put the model on DLA with --useDLACore=[ID] when inferring with trtexec.
For Deepstream, please find below the topic to modify the config to run on DLA.

Thanks.

viraldoshi321 · April 9, 2024, 11:07pm

Thanks a lot @AastaLLL How do I verify whether my .engine file of the model I generated is using DLA.
The execution was successful and I passed --useDLACore=0 --allowGPUFallback as arguements when I converted my onnx file to engine. Just for your reference I am using arcface_r100_v1.onnx and converting that to engine. I later plan to use it in Deepstream.

AastaLLL · April 10, 2024, 5:37am

Hi,

You should see the corresponding info with trtexec:

For example:

$ /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/resnet50/ResNet50.onnx --saveEngine=ResNet50.trt --useDLACore=0 --allowGPUFallback
...
[04/10/2024-05:31:18] [I] === System Options ===
[04/10/2024-05:31:18] [I] Device: 0
[04/10/2024-05:31:18] [I] DLACore: 0(With GPU fallback)
[04/10/2024-05:31:18] [I] Plugins:
...
[04/10/2024-05:31:25] [I] [TRT] ---------- Layers Running on DLA ----------
[04/10/2024-05:31:25] [I] [TRT] [DlaLayer] {ForeignNode[node_of_gpu_0/conv1_1 + node_of_gpu_0/res_conv1_bn_1...node_of_gpu_0/pred_1]}
[04/10/2024-05:31:25] [I] [TRT] ---------- Layers Running on GPU ----------
[04/10/2024-05:31:25] [I] [TRT] [GpuLayer] SHUFFLE: reshape_after_node_of_gpu_0/pred_1
[04/10/2024-05:31:25] [I] [TRT] [GpuLayer] SOFTMAX: (Unnamed Layer* 180) [Softmax]
...
&&&& PASSED TensorRT.trtexec [TensorRT v8502] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/resnet50/ResNet50.onnx --saveEngine=ResNet50.trt --useDLACore=0 --allowGPUFallback

Thanks.

system · May 8, 2024, 1:40am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
We want to use GPU+DLA. How do I use DLA when converting onnx to trt model? Is there a python sample Jetson Xavier NX jetson-inference	4	1089	September 19, 2021
Jetson Orin: All layers pushed to GPU, zero layers on DLA Jetson AGX Orin tensorrt , dla	7	1056	April 26, 2023
Cannot run model exported from TLT on Jetson's DLA TAO Toolkit tensorrt	7	449	October 12, 2021
[TensorRT] Running a simple onnx model on Jetson Xavier DLA Jetson Xavier NX tensorrt , onnx	12	3091	August 10, 2022
Trtexec failed to generate engine (Internal Error) with DLA Jetson Orin NX tensorrt , nvbugs , dla	7	1072	April 8, 2024
Cannot build a TensorRT engine for DLA because Constant_output_0 is not supported in DLA Jetson AGX Orin tensorrt , dla	8	242	July 23, 2024
Cannot run model exported from TLT on Jetson's DLA TensorRT	2	328	December 16, 2020
Failed to convert YOLOv7.onnx model to DLA engine Jetson AGX Xavier tensorrt , nvbugs , dla	6	1764	May 31, 2023
Tensorrt Python API has a bug in DLA usage Jetson AGX Xavier tensorrt	11	665	August 17, 2022
DLA trtexec questions Jetson AGX Xavier	4	1817	October 18, 2021

Help for converting .onnx to TensorRT

Related topics