Running inference in Jetson Orin NX with TensorRT

slozano · July 9, 2025, 3:10pm

Hello,
I am testing the performance of TensorRT in a Jetson OrinNX. To do so, I created a small program to run the inference of a neural network. Before running my program, I export my onnx model to engine.
Reading the documentation, I see that the inference can be run in the Deep Learning Accelerator (DLA) cores. I would expect the inference to run faster in DLA core than in GPU, but that is not my case.
Is my assumption wrong? Could it be because of something related to my Neural Network? Am I missing something related to TensorRT?

In particular, when I run the inference in GPU, the elapsed time is 13 ms but in DLA cores the elapsed time is 30 ms

AastaLLL · July 10, 2025, 5:31am

Hi,

Please find the explanation in our document below:
https://docs.nvidia.com/deeplearning/tensorrt/archives/tensorrt-1030/developer-guide/index.html#troubleshooting

Q: Why does my network run slower when using DLA than without DLA?

A: DLA was designed to maximize energy efficiency. Depending on the features supported by DLA and the features supported by the GPU, either implementation can be more performant. Your chosen implementation depends on your latency or throughput requirements and power budget. Since all DLA engines are independent of the GPU and each other, you could also use both implementations to increase the throughput of your network further.

Thanks.

Topic		Replies	Views
Jetson Orin AGX DLA does't works normal, infer speed is lower than without DLA Jetson AGX Orin dla	6	263	April 24, 2025
Getting less throughput while enabling DLAs on Jetson AGX Orin Jetson AGX Orin dla	5	881	February 23, 2023
DLA-v2 is slower than DLA-v1 Jetson AGX Orin tensorrt , jetson-inference	8	2888	July 6, 2022
Why is the inference speed of DLA on agx orin much slower than that without DLA? TensorRT dla	1	101	March 28, 2025
Big difference between using DLA core and not using DLA core Jetson Xavier NX tensorrt , dla	4	3197	October 18, 2021
How to improve performance of Jetson Orin NX Jetson Orin NX dla	2	934	April 18, 2025
Compute time in DLA slower than expected Jetson AGX Orin dla	5	1091	July 28, 2023
Why yolox inference time with DLA is longer than without DLA ，81 ms vs 8 ms? Jetson AGX Orin dla	5	696	June 9, 2023
DLA makes inference much slower TensorRT	0	571	December 23, 2019
How to use DLA correctly? Jetson Orin NX dla , gpu	6	855	March 10, 2025

Running inference in Jetson Orin NX with TensorRT

Related topics