How to improve performance of Jetson Orin NX

feng871345432 · April 17, 2025, 8:03am

Hi there,

I am working on deploy my segmentation model on Jetson Orin NX and Jetson Orin Nano, so I compare the performance of two device.

I found some intresting points:

Orin Nano with high GPU frequency compare with Orin NX(25W mode), so Orin Nano 8GB even faster than Orin NX 16GB when infering (Low latency and high throughput), but for my case ,8GB is not enough.
Even Orin NX has DLA, from the spec, the role of DLA is to relieve the burden on the GPU, enabling the transfer of certain layers in the model to run on the DLA. It also offers high power efficiency. However, after moving the layers to the DLA, although the GPU load is reduced, the inference speed is slower compared to running on the GPU.

So this creates an awkward situation. If I want to use a 16GB device, my only option is the Orin NX. However, the inference performance of the Orin NX is inferior to that of the Orin Nano. Even though it has DLA, the inference using DLA is slower than that on the GPU.

Any suggestion about how to config the Orin NX to optimize the throughout with my model(already convert to tensorrt with DLA and without DLA and already do the precision speedup).

Or any other device suit for my case? What I want is the best performance with throughput and with 16GB memory.

BTW, I noticed that I can switch the power mode to MAXN, but does it a best practice? For I also noticed that it will cause the system to unstable.

Thanks.

AastaLLL · April 18, 2025, 2:26am

Hi,

Could you check the throughput when running with MAXN on Orin NX and fix the clock to the maximum?

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

The acceleration of DLA depends on the model architecture.
Since those layers do not run on the DLA, the data needs to transfer back to the GPU and cause some overhead.

You can find the below link to convert more layers into DLA:

Thanks.

system · May 21, 2025, 1:06am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Jetson Orin AGX DLA does't works normal, infer speed is lower than without DLA Jetson AGX Orin dla	6	241	April 24, 2025
Orin NX performances - comparing with Xavier NX Jetson Orin NX jetson-inference	6	3083	August 2, 2023
Low performance while running model on DLA0, DLA1, and GPU at the same time on Jetson AGX Orin 64 GB Jetson Orin NX dla	7	1101	February 14, 2023
How to use DLA correctly? Jetson Orin NX dla , gpu	6	750	March 10, 2025
The Throughput is too slow in Nvidia jetson AGX ORin DLA Jetson AGX Orin cuda , cudnn , dla	4	609	January 31, 2024
Running inference in Jetson Orin NX with TensorRT Jetson Orin NX tensorrt	2	122	July 29, 2025
Run AI models completely on Jetson AGX Orin DLAs Jetson Nano dla	4	584	April 20, 2024
Getting less throughput while enabling DLAs on Jetson AGX Orin Jetson AGX Orin dla	5	868	February 23, 2023
Jetpack 6.2 to run AI full loading Jetson Orin NX board-design , dla	6	493	February 24, 2025
Why yolox inference time with DLA is longer than without DLA ，81 ms vs 8 ms? Jetson AGX Orin dla	5	676	June 9, 2023

How to improve performance of Jetson Orin NX

Related topics