Exclusive compute mode doesn't work with multiple GTX295's & 64-bit Linux

evan9999 · September 16, 2009, 10:01pm

Hi Nvidia Gurus!

I have two GTX295 boards on RedHawk 5.3 Real Time 64-bit Linux (kernel 2.6.26.8). I am using the following script to set up exclusive compute mode. Nvidia-smi is running continuously to act as a client to enforce exclusive compute mode settings so that each host application will hopefully use a different GPU:

Nvidia forum states nvidia-smi must be running continuously in the background for a GPU mode to stay “set”

nvidia-smi -l -i 30 -lsa &

Now actually set the modes to exclusive use by one host thread per GPU…

sudo nvidia-smi -g 0 -c 1
sudo nvidia-smi -g 1 -c 1
sudo nvidia-smi -g 2 -c 1
sudo nvidia-smi -g 3 -c 1

Now list the compute modes we just set…

nvidia-smi -g 0 -s
nvidia-smi -g 1 -s
nvidia-smi -g 2 -s
nvidia-smi -g 3 -s

The GTX295 cards claim they are set for exclusive compute mode according to nvidia-smi. I start two host applications, the first of which enumerates a total of 4 GPUS, and it selects GPU0. The other application starts, enumerates all 4 GPUS, then uses cudaSetDevice to select GPU 2. Both host applications then seem to by trying to use GPU0, as I see it’s temperature spike much higher than the other 3 GPUS, which appear to be idling. What am I doing wrong ? Do I need to use the Cuda driver API instead of Cuda for C ??

Thanks,

Evan Wheeler

mfatica · September 16, 2009, 10:35pm

Remove the cudaSetDevice.

What is the output of “/sbin/lsof /dev/nvidi* |grep mem” when the two copies of the application are running?

evan9999 · September 17, 2009, 1:58am

I did remove cudaSetDevice, but it made no difference…

Here’s the output when the two apps are running:

[syssw@dvds1 SdlMultiStream]$ lsof /dev/nvidi* | grep mem

SdlMultiS 16093 syssw mem CHR 195,1 10276 /dev/nvidia1
SdlMultiS 16093 syssw mem CHR 195,3 10275 /dev/nvidia3
SdlMultiS 16141 syssw mem CHR 195,1 10276 /dev/nvidia1
SdlMultiS 16141 syssw mem CHR 195,3 10275 /dev/nvidia3

Topic		Replies	Views
Unable to set exclusive compute mode using nvidia-smi CUDA Programming and Performance	5	48671	May 20, 2009
Compute modes not adequate, misleading? Suggestions about compute modes CUDA Programming and Performance	0	2299	June 4, 2009
Speed problem on 295 gtx cards CUDA Programming and Performance	19	10551	January 8, 2010
compute-exclusive mode and non-Teslas it seems to work... CUDA Programming and Performance	1	12911	January 18, 2011
MPI causing trouble in memory allocation? CUDA Programming and Performance	5	11902	November 28, 2009
exclusive compute mode with OpenCL CUDA Programming and Performance	0	1274	April 23, 2010
Asking a gpu how busy it is? CUDA Programming and Performance	3	1244	September 3, 2009
Choosing CUDA device programmatically CUDA Programming and Performance	3	8956	August 13, 2009
Setting Compute Mode in Windows CUDA Programming and Performance	0	2806	November 3, 2009
Multi GPU question CUDA Programming and Performance	7	5179	August 10, 2009

Exclusive compute mode doesn't work with multiple GTX295's & 64-bit Linux

Nvidia forum states nvidia-smi must be running continuously in the background for a GPU mode to stay “set”

Now actually set the modes to exclusive use by one host thread per GPU…

Now list the compute modes we just set…

Related topics