cudaSetDevice question

eyalhir74 · January 7, 2009, 9:46am

Hi,
I’ve started recently using 2 GPUs on one system. Is it reasonable that a cudaSetDevice would take 320ms ???
Is the cudaSetDevice method relevant for the thread calling it? so for the default device (device 0) I shouldn’t call it and save the time
while for threads working with the second GPU and up I should call it?

thanks
eyal

BarsMonster · January 7, 2009, 2:20pm

current device is set per-thread.
You may set it just once.

E.D_Riedijk · January 7, 2009, 2:31pm

You have to call it only once, so I think that it is not that bad. It also creates a context (overhead you would otherwise see in the first kernel call or cudamalloc, so there is no way around it I believe)

eyalhir74 · January 7, 2009, 4:30pm

Thanks a lot for the explaination :)

tmurray · January 7, 2009, 5:03pm

cudaSetDevice does not create a context.

E.D_Riedijk · January 7, 2009, 6:30pm

oops…

then 320 ms sounds like a lot.

tmurray · January 7, 2009, 7:03pm

It might be getting the device list and such for the first time. Not sure.

eyalhir74 · January 8, 2009, 7:45am

Ok thanks… I’ll try to init this on startup and see what it gives

thanks

eyal

Vishu · February 3, 2009, 2:27am

I have a system with more than one GPU (1 GTX 280 and two Tesla). I am using one of the SDK examples and am trying to set the device which is not being used. I obtained the following code from the website:

https://www.cs.virginia.edu/~csadmin/wiki/i…/Choosing_a_GPU (If this link does not open, try google cache!)

[codebox]

int setdevice()

{

int num_devices, device;

cudaGetDeviceCount(&num_devices);

if (num_devices > 1) {

  int max_multiprocessors = 0, max_device = 0;

  for (device = 0; device < num_devices; device++) {

          cudaDeviceProp properties;

          cudaGetDeviceProperties(&properties, device);

          if (max_multiprocessors < properties.multiProcessorCount) {

                  max_multiprocessors = properties.multiProcessorCount;

                  max_device = device;

          }

  }

  cudaSetDevice(max_device);

  device=max_device;

}

return device;

}

[/codebox]

But it doesn’t work ! Two codes which run end up chosing the devise 0. How do I fix the problem? I have followed the instructions given on the website like not calling cudaInit(int argc, char **argv) function from my code. Is it because multiProcessorCount for all the devices are same? How do I check which device is being used and which device is free?

thanks…

Vishu

MisterAnderson42 · February 3, 2009, 1:32pm

Why do you say that? The code obviously is choosing the device with the maximum number of multiprocessors (something you could do with a simple cudaChooseDevice btw). From your description, it is working as designed.

You can’t. This is a feature we have been begging for since almost 2 years ago. NVIDIA keeps saying, “we’re thinking about it”. If you want multiple jobs to run on separate GPUs, you need an external solution. I.e. lock files, IPC, or job queuing systems such as openPBS/torque or the sun grid engine.

Vishu · February 3, 2009, 2:14pm

Thanks for the response…

This is why: I first run an SDK code which outputs that the device 0 is being used.

Then I run this modified (particles SDK) code with setdevice function. Note that it returns the device number chosen. It says zero as well !

I am a little confused. If that is the case, how would any code which is supposed to chose the free device work? In the above code that I posted, does the multiProcessorCount depend on whether that device is being run or not?

Thanks again in advance !

AndreiB · February 3, 2009, 3:02pm

There’s no way to check if device is ‘free’ or not. If your application is the only user of GPUs, you can implement so tracking at your level, something like MisterAnderson42 suggested before.

multiProcessorCount depends only on device type, i.e. it will always be 30 for GTX 280/Tesla C1060, no matter device is busy or not.

Vishu · February 3, 2009, 3:52pm

Thanks! So that means the setdevice function is not really useful for my case. I guess I will have to manually give a number for the cudaSetDevice to chose a GPU depending on what I have done for other running codes.

Topic		Replies	Views
Quick Question on cudaSetDevice()? It does not work in my case. CUDA Programming and Performance	5	11952	November 20, 2009
How many times does cudaSetDevice need to be called? CUDA Programming and Performance	4	2652	July 6, 2009
Help: A problem with cudaSetDevice() CUDA Programming and Performance	6	1937	April 3, 2010
cudaSetDevice() time, so weird! cudaSetDevice() take a long time. CUDA Programming and Performance	10	4730	August 2, 2010
cudaSetDevice switch to different thread? CUDA Programming and Performance	2	3282	April 16, 2008
A question about using cudaSetDevice CUDA Programming and Performance	4	9426	November 2, 2011
cudaSetDevice in each device function call? CUDA Programming and Performance	3	4849	August 24, 2009
cudasetdevice no effect cudasetdevice question CUDA Programming and Performance	6	6191	May 5, 2011
cudaSetDevice : overrides the auto select free gpu feature in cuda CUDA Programming and Performance	2	1140	February 6, 2015
cudaSetDevice() problem from pthread CUDA Programming and Performance	0	993	August 1, 2011

cudaSetDevice question

Related topics