The issue of GPU usage in tensorrt dla inference models

haihua.wei · February 28, 2023, 6:15am

Please provide the following info (tick the boxes after creating this topic):
Software Version
DRIVE OS 6.0.5
DRIVE OS 6.0.4 (rev. 1)
[*] DRIVE OS 6.0.4 SDK
other

Target Operating System
[*] Linux
QNX
other

Hardware Platform
DRIVE AGX Orin Developer Kit (940-63710-0010-D00)
DRIVE AGX Orin Developer Kit (940-63710-0010-C00)
DRIVE AGX Orin Developer Kit (not sure its number)
[*] other

SDK Manager Version
1.9.1.10844
[*] other

Host Machine Version
native Ubuntu Linux 20.04 Host installed with SDK Manager
native Ubuntu Linux 20.04 Host installed with DRIVE OS Docker Containers
native Ubuntu Linux 18.04 Host installed with DRIVE OS Docker Containers
[*] other

When I ran our deep learning model using tensorrt’s DLA0 and DLA1, it was clear that only a few operators were running on the GPU, and we found that GR3D_FREQ 40% when we tested with tegrastats.

However, using Nsight Systems analysis, we found that GPU resources take up very little resources and time, as shown in the figure below.

What is this phenomenon and how is tegrastats’ GPU share calculated?

SivaRamaKrishnaNV · February 28, 2023, 7:44am

Dear @haihua.wei,
Do you see any difference with changing interval parameter in tegrastats?
Nsight here shows timeline view of application execution. Tegrastats shows the how much GPU is in use when it is sampled at that time. The display output gets refreshed in certain interval time. It does not indicate if the GPU is use constantly.

haihua.wei · February 28, 2023, 11:14am

We haven’t tried tegrastats yet --interval, let’s try it and see what happens

haihua.wei · February 28, 2023, 11:20am

tegrastats --interval,tried it and still had the same result

haihua.wei · March 8, 2023, 7:52am

Are there other tools to visualize GPU usage, memory bandwidth usage, and MAC utilization statistics?

SivaRamaKrishnaNV · March 8, 2023, 9:02am

Dear @haihua.wei,
Did you check NVIDIA Nsight compute tool? It can be used to identify issues at CUDA kernel and get guidance. Also it track of various metrics. Please see Nsight Compute :: Nsight Compute Documentation as well.

Let me check internally on tegrastats behavior and update you.

SivaRamaKrishnaNV · March 8, 2023, 9:47am

Dear @haihua.wei,
how is tegrastats’ GPU share calculated?

GPU utilization here indicates how many sampled cycles are active during a period. so even GPU utilization shows 99%, it doesn’t mean GPU has achieved its maximum computing power. I hope it clarifies.

haihua.wei · March 9, 2023, 8:48am

Where can I download the Drive OS version of Nsight Compute? Can it be used to monitor DLA?
@SivaRamaKrishnaNV

SivaRamaKrishnaNV · March 9, 2023, 11:17am

Dear @haihua.wei,
Do you see /usr/local/cuda/bin/ncu-ui?
For DLA trace we need to nsys.

haihua.wei · March 10, 2023, 2:44am

Thanks, we found it.
@SivaRamaKrishnaNV

system · March 31, 2023, 9:12am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
GPU Usage DRIVE AGX Orin General driveos	4	346	April 4, 2024
Understanding tegrastats output DRIVE AGX Orin General driveos	1	11	November 21, 2024
How to check GPU Memory Utilization DRIVE AGX Orin General driveos-cuda	3	1692	January 18, 2024
GPU is getting utilized 99 % with face detection code using caffe model DRIVE AGX Orin General driveos-dl	6	35	November 18, 2024
There are GPU usage even if the model all ran on DLA DRIVE AGX Orin General driveos-dl	10	560	August 11, 2023
Installing tegrastats GUI DRIVE AGX Orin General driveos	18	222	August 8, 2024
How to obtain GPU memory information on NVIDIA DRIVE AGX Orin DEV Kit devices with DRIVE OS 6.0.8? DRIVE AGX Orin General driveos	5	478	February 9, 2024
How to use tegrastats to track gpu usage on drive orin DRIVE AGX Orin General drive-misc	7	490	June 10, 2024
When GPU and DLA are used at the same time, the time consumption increases with each other DRIVE AGX Orin General dla , driveos-dl	10	816	March 9, 2023
0% GPU usage from tegrastats in DRIVE OS 5.0.10.3L DPX2 General	9	1510	October 12, 2021

The issue of GPU usage in tensorrt dla inference models

Related topics