NVIDIA-SMI: Great starting point for monitoring GPU...

lwignall · March 28, 2014, 5:03pm

With NVIDIA drivers installs there is the nvidia-smi tool. See the following link for info:
https://developer.nvidia.com/nvidia-system-management-interface

See this link for the man page and various switches/tools to use:
http://developer.download.nvidia.com/compute/cuda/5_5/rel/nvml/nvidia-smi.5.319.43.pdf

For GRID specifically, keep in mind the location of the driver. As an example, if using vSGA (shared) in VMware then the driver is the VIB loaded on the ESXi hypervisor, so the nvidia-smi executable is located there, on the hypervisor. If on that same deployment you chose to use on GPU as pass-through then that GPU is looking to the NVIDIA driver on the guest and so nvidia-smi is located on the guest.

rachel2 · April 3, 2014, 9:41am

For vGPU do make sure you run "nvidia-smi -l" without the loop flag you can see stalls as it reenters the API - I added some other tools here: http://www.xenserver.org/partners/developing-products-for-xenserver/20-dev-hints/133-xs-dev-tools.html

and GPU specific ones: http://www.xenserver.org/partners/developing-products-for-xenserver/18-sdk-development/136-xs-dev-gpu-tools.html

For pass-through you will need to use something in guest e.g. Fraps

andy30 · April 4, 2014, 5:28pm

There was a bug in the tech preview / beta release of vGPU where running nvidia-smi would cause a visible glitch or stall in VMs running vGPU - but this was fixed in the vGPU 1.0 production release. You shouldn’t see any stall when running nvidia-smi now.

Topic		Replies	Views
Capture vGPU performance data in csv format using nvidia-smi Monitoring/Assessment Tools	0	29550	May 19, 2014
Tesla GPU info monitoring tool CUDA Programming and Performance	2	10827	July 2, 2012
How Can You Monitor Memory Usage on a XenServer Running XenApp in GPU Passthrough Mode? XenApp	5	13165	May 25, 2014
Nvidia VMware vSphere-6.7 NVIDIA Virtual GPU Technology	14	10156	August 19, 2019
What software to use for our new single NVIDIA T4 Tesla card on VMware 6.7 ESXi Host General Discussion	14	10009	August 17, 2020
vGPU Utilization Per VM NVIDIA Virtual GPU Technology	22	39457	August 25, 2016
GPU in a VM pass-through setting NVIDIA Virtual GPU Drivers	19	70449	April 29, 2021
Monitor GPU usage with nvidia-smi Linux	6	5683	October 14, 2021
Nvidia-smi "No device where found" Linux	36	5979	December 30, 2021
NVIDIA-SMI couldn't communicate with the NVIDIA driver General Discussion	6	1680	March 8, 2022

NVIDIA-SMI: Great starting point for monitoring GPU...

Related topics