nvidia-smi with CUDA dev environment

heipei · April 16, 2012, 9:07am

Hi everyone,

I need to use nvidia-smi to frequently poll my GPUs and keep them active, or else this will happen: The Official NVIDIA Forums | NVIDIA. Also, I want to be able to set compute exclusive mode.

I’m on Ubuntu (11.10) and have installed the CUDA Toolkit (4.1) and the NVIDIA 285.05.33 driver from the website. The nvidia-smi tool is not part of it however, and if I install nvidia-current from the Ubuntu repository it messes up my driver installation.

So, the question is this: How can I install nvidia-smi (or something similar) without running into driver-trouble. Alternatively, how can I poll the GPUs to keep them active (see link).

Przemyslaw_Zych · April 16, 2012, 10:08am

Hi heipei,

nvidia-smi and NVML should be both installed as a part of NVIDIA Driver Installation process to /usr/bin/nvidia-smi and /usr/lib64/libnvidia-ml.so*.

Are you installing the driver in some non-standard way?

A quick fix for your case could be:

Write a very short application that runs “cuInit” and freezes. This will keep the device nodes busy.

Or write an application that opens the nodes yourself:

Something like:

fopen("/dev/nvidiactl", "r")

 fopen("/dev/nvidia0", "r")

 fopen("/dev/nvidia1", "r")

...

The best way though would be to download the Driver again. e.g. http://developer.download.nvidia.com/compute/cuda/4_1/rel/drivers/NVIDIA-Linux-x86_64-285.05.33.run

And make a fresh default installation with “./NVIDIA-Linux-x86_64-285.05.33.run -s”.

When both NVML and (the command line wrapper around NVML) nvidia-smi are both present, the best solution would be to enable “persistence mode”

nvidia-smi --persistence-mode=1

Which will prevent the driver from unloading.

Let me know if you have other questions.

Regrads,

Przemyslaw Zych

heipei · April 16, 2012, 11:33am

Thanks a lot, that solved it. I had installed the driver using sh <filename.sh>, without the -s flag. Now I have nvidia-smi and was able to set persistence mode. Thank you for the quick response!

Topic		Replies	Views
In what step is nvidia-smi supposed to be installed? CUDA Programming and Performance	13	118332	December 16, 2022
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running TensorRT cuda , cudnn , nvidia-smi	3	803	July 4, 2024
nvidia-smi hangs. cannot be killed even by SIGKILL CUDA Setup and Installation	1	10285	April 5, 2016
nvidia-smi : how to make compute mode permanent compute mode reverts to 0 after reboot CUDA Programming and Performance	2	6309	September 21, 2010
NVIDIA-SMI no longer works and fresh nvidia-driver installs fail CUDA Setup and Installation cuda , ubuntu	1	1734	January 16, 2024
Delete all nvidia-driver but still show in nvidia-smi on ubuntu 18.04 CUDA Setup and Installation	3	2397	February 12, 2024
Cannot query NVIDIA drivers on Ubuntu - new issue cuDNN	3	1074	March 11, 2020
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. CUDA Setup and Installation	0	538	February 24, 2020
Cuda10 installing problem, nvidia-smi is not working CUDA Setup and Installation	1	4761	December 27, 2019
Newly installed drivers are not found when nvidia-smi is called. Linux	16	33445	February 10, 2025

nvidia-smi with CUDA dev environment

Related topics