Model running on DLA with TensoRT(8.4.0) is slower than TensorRT(8.3.0)

Constantineman · June 23, 2022, 9:43am

Hi,

I install JetPack 5.0.1 to Orin, the version of TensorRT is 8.4.0. I test model on DLA with gpu fallback, found that the latency(fp16) is significantly slower than TensorRT 8.3.0. The difference is very large, about two to three times。

A test model onnx file.
model.onnx (10.0 MB)

FP16 results comparison(trt 8.4.0 vs 8.3.0):
31ms vs 13ms
But int8 is 3ms under trt 8.4.0, seems reasonable.

I want to know why the version upgrade brings such a big difference in latency? Or the upgrade brings bugs?
What is the theoretical ratio of TOPS between int8 and fp16 on DLA? Is the results difference (fp16 vs int8) reasonable?

System details:
Machine: Jetson AGX Orin 64GB
SDK: JetPack 5.0.1 Developer Preview
Power mode: MAXN

cmd: trtexec --onnx=model.onnx --useDLACore=0 --allowGPUFallback --fp16

SivaRamaKrishnaNV · June 24, 2022, 3:43am

Dear @Constantineman,
could you share the complete trtexec logs of both TRT 8.3 and TRT 8.4

Splendor027 · June 24, 2022, 4:09pm

@Constantineman

How did you install TensorRT 8.3.0 to AGX Orin? I guess there is publicly available released version of 8.3.0 for AGX Orin? Can you specify the JetPack version or any installation method?

SivaRamaKrishnaNV · July 28, 2022, 5:24am

There is no update from you for a period, assuming this is not an issue any more.
Hence we are closing this topic. If need further support, please open a new one.
Thanks

Dear @Constantineman,
Could you provide any update on the ask?

system · August 24, 2022, 1:12am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
DLA-v2 is slower than DLA-v1 Jetson AGX Orin tensorrt , jetson-inference	8	2483	July 6, 2022
Keys to optimization a network on AGX Orin DLA for latency Jetson AGX Orin tensorrt , dla	2	827	October 6, 2023
TensorRT 8.6 Performance Issue in AGX Orin 32Gb Jetson AGX Orin tensorrt	9	425	February 27, 2024
Compute time in DLA slower than expected Jetson AGX Orin dla	5	908	July 28, 2023
GeMM performance on Orin DLA Jetson AGX Orin tensorrt , cuda , jetson-inference	10	880	February 21, 2024
TFLOPS(FP16) about DLA (Deep Learning Accelerator) on Jetson Orin NX Jetson AGX Orin dla , kb	4	1712	April 13, 2023
TRT engine successful built on JetPack 5.0.1(trt 8.4.1) but not on JetPack 5.1.2(TensorRT 8.5.2) Jetson Xavier NX tensorrt , dla	13	880	September 25, 2023
Getting less throughput while enabling DLAs on Jetson AGX Orin Jetson AGX Orin dla	5	758	February 23, 2023
Is there a plan to support DLA on the next TensorRT version? Jetson AGX Orin tensorrt , nvbugs , dla , tensorrt-model-optimizer	5	139	December 31, 2024
Orin NX 10% slower than Xavier AGX Jetson Orin NX performance	4	75	January 2, 2025

Model running on DLA with TensoRT(8.4.0) is slower than TensorRT(8.3.0)

Related topics