Run GPU and DLAs concurrently

simon.eichhammer · April 15, 2021, 12:41pm

Hi,

i’m using Jetpack 4.3 and TensorRT 6.0.

I try to run three networks on the Xavier AGX. The largest is run on the GPU and the others on the DLA0 and DLA1. The inference is running in three threads, for each hardware one.

But it seems the GPU and DLAs run serially and not concurrently. (view image)

I used trtexec to generate the engines and the engines for the DLAs were build without GPUFallback. All Layers that are supposed to run on the DLAs are supported.

I used nvvp for profiling.

Thanks in Advance

AastaLLL · April 16, 2021, 2:14am

Hi,

Our profiler doesn’t support DLA profiling yet.
So you can only find the timeslot when DLA thread use GPU for reformatting (data transfer).
The real inference part is missing.

Thanks.

simon.eichhammer · April 16, 2021, 6:18am

Hi,

I know that the profiler doesn’t profile the DLAs.
But at the time the DLAs get executed the GPU is idle and GPUFallback is disabled.

If i synchronize the DLAs and the GPU the duration for the inference is the same.

Thanks.

simon.eichhammer · April 16, 2021, 10:56am

I solved my problem.

For me the solution was to start the inference of the DLAs earlier and i also created an additional thread for the GPU inference.
I started the inference of the DLAs right after the inference of the GPU started.
With this adaption the overall execution time is reduced.

Maybe this could help someone in the future.

Topic		Replies	Views
DLA Fallback to GPU even GPUFallback Flag is not set Jetson AGX Xavier dla	4	983	October 18, 2021
DLA and GPU cores at the same time Jetson AGX Xavier dla	20	10221	October 18, 2021
DLA and GPU running at the same time - performance question Jetson AGX Xavier nvbugs , performance , dla	24	3140	October 18, 2021
Multiple models on DLAs in AGX Xavier 32TOPs Jetson AGX Xavier	13	1344	October 18, 2021
Use DLA Jetson AGX Xavier	3	687	October 18, 2021
DLA and GPU run concurrently in Xaiver Jetson AGX Xavier	2	967	October 18, 2021
Can NVDLA and GPU work in parallel? TensorRT	3	2108	October 12, 2021
Profiling DLA with GPU fallback on Jetson Xavier Jetson AGX Xavier dla	6	1519	August 29, 2021
DLA purpose Jetson AGX Xavier	2	6170	October 18, 2021
Multiple errors running an inference using two DLAs and GPU on Jetson AGX Xavier Jetson AGX Xavier tensorrt , jetson-inference	4	382	November 9, 2022

Run GPU and DLAs concurrently

Related topics