Multi-User GPGPU

hoschi123 · August 31, 2010, 8:27am

Hallo,

we have got a Tesla S1070 GPU-Rack within our cluster system, which we want to use in a multiple user mode. Unfortunately, this does not work correct. I wrote a dgemm-benchmark using cublas, where the dgemm-routine is calles many times. When another user starts a CUDA-application in most cases the execution is blocked, but after several tries the second application starts and my benchmark is killed with an unspecific launch failure. How checks the driver the availbility of the device? Why can another application kill my program although the device memory is not freed yet? This really an issue for our multi user mode…

CUDA Driver Version: 3.10
CUDA Runtime Version: 3
Compute mode: Exclusive (only one host thread at a time can use this device)

Kind Regards,

Tim

avidday · August 31, 2010, 10:27am

It sounds a lot like this bug. Compute exclusivity seems to mess up for kernels which require a reasonably large number of registers. Tim Murray indicated there is a fix coming for it, but I don’t know precisely which driver versions are effected, and when in the release cycle the fix will get into drivers.

avidday · August 31, 2010, 10:27am

It sounds a lot like this bug. Compute exclusivity seems to mess up for kernels which require a reasonably large number of registers. Tim Murray indicated there is a fix coming for it, but I don’t know precisely which driver versions are effected, and when in the release cycle the fix will get into drivers.

hoschi123 · August 31, 2010, 11:12am

Mh… Thx for the link to the other thread. I think I use the latest driver. Is there a possiblity to sign in to confirmation list to the corresponding bug? Or does anybody know when this will be fixed?

hoschi123 · August 31, 2010, 11:12am

Mh… Thx for the link to the other thread. I think I use the latest driver. Is there a possiblity to sign in to confirmation list to the corresponding bug? Or does anybody know when this will be fixed?

tmurray · September 8, 2010, 5:12pm

The fix for this bug is coming out with CUDA 3.2/R260.

tmurray · September 8, 2010, 5:12pm

The fix for this bug is coming out with CUDA 3.2/R260.

hoschi123 · September 10, 2010, 11:40am

Ok. Thanks for that.

hoschi123 · September 10, 2010, 11:40am

Ok. Thanks for that.

Topic		Replies	Views
Multi GPU question CUDA Programming and Performance	7	5142	August 10, 2009
Multi-user-systems und multi-gpu-usage CUDA Programming and Performance	9	6227	July 15, 2008
Unstable/Unreliable GPU Device (Tesla C1060) CUDA Programming and Performance	3	3265	May 13, 2010
Preventing gpu calls in gpu idle time. CUDA Programming and Performance	2	1240	March 4, 2009
multi gpu + exclusive mode + matlab, can't run two processes - kernel crashes CUDA Programming and Performance	0	3033	May 17, 2010
Failure with independent devices on independent processes Try it yourself! CUDA Programming and Performance	19	3464	March 10, 2011
CUDA Tesla M60 GPU Not Detected CUDA Setup and Installation	1	1597	August 16, 2017
Multiple GPU's of different types CUDA Programming and Performance	3	3374	July 18, 2008
Single Device, Multithreaded host, cuda error: unspecified launch failure CUDA Programming and Performance	0	702	January 2, 2014
Running multiple CUDA apps on same GPU card Serious performance drop CUDA Programming and Performance	1	1135	March 14, 2011

Multi-User GPGPU

Related topics