Unable to set exclusive compute mode using nvidia-smi

dahansen · May 11, 2009, 4:14pm

I just installed the new CUDA Toolkit 2.2 along with the latest (185.18.08) nvidia driver. We have a Tesla S1070 1U box that is part of a batch system, and for this reason would really like the exclusive compute mode. However, I can’t seem to get nvidia-smi to set it. Here is how I’m running it:

nvidia-smi -g 0 -c 1

and I’ve also tried:
nvidia-smi --gpu=0 --compute-mode-rules=1

When I run it, I then check the compute mode in my program using cudaGetDeviceProperties(), but it never comes back with the exclusive mode. Is there something I’m missing here? The new syntax for compute modes seems to be completely useless. Any insights here woudl be greatly appreciated.

Thanks,
Dan Hansen

mfatica · May 11, 2009, 11:34pm

You need to have nvidia-smi running in the background:

nvidia-smi --loop-continuously --interval=60 --filename=/var/log/nvidia-smi.log &

[root@compute-0-0 ~]# nvidia-smi -g 1 -c 1
[root@compute-0-0 ~]# nvidia-smi -g 0 -c 1
[root@compute-0-0 ~]# nvidia-smi -g 1 -s
Compute-mode rules for GPU=0x1: 0x1
[root@compute-0-0 ~]# nvidia-smi -g 0 -s
Compute-mode rules for GPU=0x0: 0x1

mutch · May 13, 2009, 12:30am

Hi,

Can you be a little clearer on what this is doing and why it’s necessary?

When you set the compute mode for a given GPU, where is that information stored? Does it “go away” after some time interval, unless the “–loop-continuously” command continually revives it? This doesn’t make sense to me.

Thanks in advance,

Jim

tmurray · May 13, 2009, 12:32am

Part of the kernel module will unload itself when no client of the driver is running. nvidia-smi is a client, so it ensures that the configuration data doesn’t get reset.

We’re going to fix this in a future driver release.

mutch · May 13, 2009, 3:57am

Thanks, that makes sense. It’s working for me now.

Mat38 · May 20, 2009, 12:13pm

For those who wants to set the exlusive compute mode on each card of the TESLA S1070, don’t forget to type these commands also for card 2 and 3

[root@compute-0-0 ~]# nvidia-smi -g 3 -c 1

[root@compute-0-0 ~]# nvidia-smi -g 2 -c 1

[root@compute-0-0 ~]# nvidia-smi -g 1 -c 1

[root@compute-0-0 ~]# nvidia-smi -g 0 -c 1

[root@compute-0-0 ~]# nvidia-smi -g 3 -s

Compute-mode rules for GPU=0x3: 0x1

[root@compute-0-0 ~]# nvidia-smi -g 2 -s

Compute-mode rules for GPU=0x2: 0x1

[root@compute-0-0 ~]# nvidia-smi -g 1 -s

Compute-mode rules for GPU=0x1: 0x1

[root@compute-0-0 ~]# nvidia-smi -g 0 -s

Compute-mode rules for GPU=0x0: 0x1

Topic		Replies	Views
nvidia-smi : how to make compute mode permanent compute mode reverts to 0 after reboot CUDA Programming and Performance	2	6407	September 21, 2010
Exclusive compute mode doesn't work with multiple GTX295's & 64-bit Linux CUDA Programming and Performance	2	2693	September 17, 2009
MPI causing trouble in memory allocation? CUDA Programming and Performance	5	11864	November 28, 2009
nvidia-smi and exclusive compute mode Legacy PGI Compilers	4	18788	April 27, 2010
compute-exclusive mode and non-Teslas it seems to work... CUDA Programming and Performance	1	12897	January 18, 2011
Tesla TCC driver with default compute mode? CUDA Programming and Performance	4	2318	September 3, 2010
Where can I find nvidia-smi.exe utility CUDA Programming and Performance	22	146198	January 14, 2023
nvidia-smi with CUDA dev environment CUDA Programming and Performance	2	8786	April 16, 2012
Compute M. Prohibited in Tesla vGRID M60-2Q NVIDIA Virtual GPU Technology	2	8071	September 8, 2017
After installing CUDA 9.0 in POWER9(RHEL7), nvidia-smi shows Unknown Error in Memory_Usage column. CUDA Setup and Installation	18	3134	June 8, 2018

Unable to set exclusive compute mode using nvidia-smi

Related topics