Is there sample period change available for nvidia-smi?

sakaia · February 17, 2022, 12:05am

I am measuring an application using nvidia-smi.
It seems higher value for utilization.gpu compared to nsys (nsight systems) time chart.
From reading the nvidia-smi man, the sample period for nvidia-smi is 1 or 1/6seconds. It is good for long time CUDA kernel, but not good for short time CUDA kernel.
Can nvidia-smi change the sample period (especially for short time multi-CUDA kernel) ?

Reference
https://developer.download.nvidia.com/compute/DCGM/docs/nvidia-smi-367.38.pdf

njuffa · February 17, 2022, 12:27am

To my knowledge, nvidia-smi does not have a user-selectable setting to choose sampling frequencies from a fairly wide range. Such functionality is not needed for a tool designed for general system monitoring and management.

In my experience, specifying a high sampling frequency actually has a negative impact on the kernels execution on the GPU. On Windows, I use a third-party tool called TechPowerUp GPU-Z to monitor my GPUs. Its lowest sampling interval setting is 0.1 seconds. I tried that and judged it unusable due to the reason I stated and went back to sampling at 1 Hz. nvidia-smi (and presumably GPU-Z) makes use of the NVML (NVIDIA management library), which you can use in your own applications.

What metrics in particular are you trying to capture with fine granularity, and what kind of sampling frequency do you hope to achieve? Have you checked the literature what set-ups others use for that purpose?

Robert_Crovella · February 17, 2022, 12:53am

That’s a fairly old driver and therefore fairly old documentation.

nvidia-smi has command line help, have you checked it?

nvidia-smi --help

for example, I have driver 470.57.02 and I see in the help output:

-l,   --loop=               Probe until Ctrl+C at specified second interval.
-lms, --loop-ms=            Probe until Ctrl+C at specified millisecond interval.

If you study the command line help a bit more, you might try something like this:

nvidia-smi -lms 100 -q -d UTILIZATION

njuffa · February 17, 2022, 1:24am

NVIDIA’s main web page for nvidia-smi

links this documentation:

This does mention the -lms option, but I overlooked it:

-lms ms, --loop-ms=ms
Same as -l,–loop but in milliseconds.

Sorry about that, did not mean to mislead.

Robert_Crovella · February 17, 2022, 1:41am

I recommend that people use the command line help for detailed nvidia-smi usage.

system · March 3, 2022, 1:42am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question about GPU Utilization in nvidia-smi output CUDA Programming and Performance ubuntu	0	383	November 27, 2023
Sampling period nvmlDeviceGetUtilizationRates CUDA Programming and Performance	1	737	August 2, 2017
Power measurement with nvidia-smi CUDA Programming and Performance	1	5926	October 12, 2017
BUG REPORT: nvidia-smi shows 0% GPU-Util when sampling elapsed_cycles_sm event CUDA Programming and Performance	1	1373	January 3, 2019
Power measurement with nvidia-smi CUDA Programming and Performance	1	1544	October 13, 2017
NVML overhead CUDA Programming and Performance	6	2108	March 24, 2020
SM frequency reported in Nsight Compute Nsight Compute	4	1001	September 1, 2023
nvidia-smi LINUX correct? CUDA Programming and Performance	1	674	May 4, 2019
Nvidia-smi -lms 1 and runtime CUDA Programming and Performance	10	2574	September 22, 2022
Why does GR3D freq always change a lot when I use sudo ~tegrastats --interval 5000 to monitor GPU Jetson AGX Xavier	10	1009	October 18, 2021

Is there sample period change available for nvidia-smi?

Related topics