Question: NVML utilization

sopia0821 · July 4, 2024, 12:40pm

I have few questions about nvml’s utilization.

I’m currently working with nvmlDeviceGetProcessUtilization, nvmlDeviceGetUtilizationRates APIs.

nvidia-smi -l 1 & nvidia-smi pmon -d 1 -o T
By this command, I wanted to get device utilization and process utilization during same sampling period

But I found some strange result.
Thu Jul 4 21:25:12 2024
±----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.54.14 Driver Version: 550.54.14 CUDA Version: 12.4 |
|-----------------------------------------±-----------------------±---------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce GTX 1660 Ti Off | 00000000:01:00.0 On | N/A |
| 43% 33C P5 12W / 120W | 921MiB / 6144MiB | 2% Default |
| | | N/A |
±----------------------------------------±-----------------------±---------------------+
21:25:12 0 1148 G - - - - - - Xorg
21:25:12 0 1295 G 12 4 - - - - gnome-shell
21:25:12 0 2739 G - - - - - - chrome --type=g
21:25:12 0 4460 G - - - - - - Code --gpu-pref
21:25:12 0 23521 G - - - - - - Slack --gpu-pre

I thought that device utilization value >= sum of process utilization values
But result above : device utilization value < sum of process utilization values

Question 1) Is sampling period of device utilization and process utilization are is different?
Official NVML document says that device utilization’s sampling period is 1/6~1sec, but i couldn’t find sampling period info about process utilization. Are they the same?

Question 2) If they are the same, do the timestamps for the measurements match? In other words, do the measurements start and end at the same time?

Question 3) Is there any way to check nvmlDeviceGetUtilizationRates’s result’s timestamp? In case of nvmlDeviceGetProcessUtilization, it is able to check each sample data’s recorded timestamp.

I hope I could get clear answer from NVML developers. Thank you :)

Topic		Replies	Views
Questions on per-process GPU utilization System Management and Monitoring (NVML)	6	2345	October 30, 2023
Sampling period nvmlDeviceGetUtilizationRates CUDA Programming and Performance	1	664	August 2, 2017
Nvidia-smi and nvmlDeviceGetUtilizationRates do not match System Management and Monitoring (NVML)	0	973	May 24, 2022
How to get gpu Utilization with nvmlDeviceGetUtilizationRates() System Management and Monitoring (NVML)	2	1692	July 18, 2019
nvmlDeviceGetPowerUsage sampling rate System Management and Monitoring (NVML)	0	583	December 18, 2023
How to monitor SM utilization and SM occupancy? System Management and Monitoring (NVML)	7	9967	January 12, 2024
NVML Process Utilization & Encoder Capacity System Management and Monitoring (NVML)	0	1256	October 16, 2020
Measure SM utilization per process System Management and Monitoring (NVML)	1	1168	January 11, 2024
Is there sample period change available for nvidia-smi? CUDA Programming and Performance	5	3332	March 3, 2022
Can I profile GPU utilization with a period shorter than 1/6 sec? System Management and Monitoring (NVML)	0	452	March 12, 2020

Question: NVML utilization

Related topics