Hello NVIDIA Community, I’m working with a Jetson Orin 64GB, and I’m trying to run a CNN inference entirely on the DLA without GPU fallback. I used the following command with TensorRT version 8.5.2: trtexec --onnx='model.onnx' --saveEngine='model_dla.trt' --fp16 --useDLACore=0 --useSpinWait --sepa…

Let’s follow the new sparsity issue on the separate topic you created: [image] Sparsity does not provide any speedup for TensorRT on DLA Jetson AGX Orin My question is: am I doing the sparsity correctly and if not, how to get the claimed speedup from adding structured spa…

TensorRT model inference fully on DLA is slow due to abnormally slow cudaEventSynchronize time

Robotics & Edge Computing Jetson & Embedded Systems Jetson AGX Orin

AastaLLL January 17, 2024, 4:38am 15

Let’s follow the new sparsity issue on the separate topic you created:

Thanks

Topic		Replies	Views
Profile results of model running on DLA mismatch between TensorRT and nsys Jetson AGX Orin tensorrt , dla	10	1112	April 5, 2023
Deep Learning Accelerator problems DRIVE AGX Xavier General	2	1448	October 12, 2021
slower when change DefaultDeviceType from GPU to DLA? Jetson AGX Xavier	3	667	October 18, 2021
Why is the inference speed of DLA on agx orin much slower than that without DLA? TensorRT dla	1	45	March 28, 2025
DLA performance DeepStream SDK	17	214	September 23, 2024
Compute time in DLA slower than expected Jetson AGX Orin dla	5	981	July 28, 2023
Unable to use DLA with TensorRT Jetson AGX Xavier	11	3391	November 8, 2018
Tensorrt Python API has a bug in DLA usage Jetson AGX Xavier tensorrt	11	680	August 17, 2022
Sparsity does not provide any speedup for TensorRT on DLA Jetson AGX Orin cudnn	6	987	January 22, 2024
Jetson Orin: All layers pushed to GPU, zero layers on DLA Jetson AGX Orin tensorrt , dla	7	1071	April 26, 2023

TensorRT model inference fully on DLA is slow due to abnormally slow cudaEventSynchronize time

Related topics