Sharing GPUs on multi-gpu, multiuser systems When cudaSetDevice() goes bad.

MJH22 · February 27, 2009, 5:02pm

PBS Pro (and probably SGE, LSF, Torque and all the other batch systems) knows squat about allocating GPUs, nor does the CUDA runtime seem to allow exclusive ownership of a device.

Here’s an LD_PRELOAD-able shim that overrides cudaSetDevice() and ensures that GPU-requesting programs will get exclusive use of a GPU, or die trying.
It arbitrates the GPU allocation with lockfiles in /var/lock/cuda (or the location of CUDA_LOCKFILE_DIR).

Use it by setting LD_PRELOAD=/path/to/cuPlayNicely.so. It will tell what it’s up to if you set CUDA_LOCKFILE_VERBOSE.

Enjoy.

Matt
cuPlayNicely.tar (10 KB)

kristleifur · February 27, 2009, 7:33pm

Cool! Thanks for sharing.

Mu-Chi_Sung · February 28, 2009, 12:56am

Wow, never thought about this approach…nice work!

But I just wondering whether cudaSetDevice() will be called or not if the user didn’t write the line manually…?

MJH22 · February 28, 2009, 3:46pm

No. If it’s not called explicitly in the user code, device #0 will used. Not obvious way to over-ride that that I can think of.

You could always move the call to the real cudaSetDevice() to the DSO’s constructor, but then any program that preloads the library will acquire a GPU, required or not.

Also, for completeness, you’d probably want to override the driver API’s set device function, in case you should have a user into that sort of self-abuse.

M

tmurray · March 1, 2009, 8:26pm

Device management will be less of an issue soon.

Guillermo_Andrade · March 17, 2009, 11:50am

Hello
I thank you for this useful tool.

But, There is a little question :
how to automatic call the function “my_fini” to release the device even when program is interrupted ?
Is there a command to call to “unload” library after one kills the program ?

Thanks,

Guillermo

Topic		Replies	Views
how to select the device manually CUDA Programming and Performance	2	4206	November 4, 2009
How cn a process get exclusive GPU access (EXCLUSIVE_PROCESS) CUDA Programming and Performance	1	1849	November 23, 2011
cudaSetDevice switch to different thread? CUDA Programming and Performance	2	3207	April 16, 2008
cudaSetDevice question CUDA Programming and Performance	12	33268	February 3, 2009
In CUDA rule, cudaSetDevice() is necessary for cudaFree() or not? CUDA Programming and Performance	0	1099	March 29, 2019
cudaSetDevice : overrides the auto select free gpu feature in cuda CUDA Programming and Performance	2	1079	February 6, 2015
Two general Multi-GPU questions. CUDA Programming and Performance	4	2791	January 24, 2012
How many times does cudaSetDevice need to be called? CUDA Programming and Performance	4	2543	July 6, 2009
CUDA 4.0 multi-gpu auto-selection how to do it? CUDA Programming and Performance	1	875	August 18, 2011
How to get fastest free GPU cutGetMaxGflopsDeviceId always same device CUDA Programming and Performance	3	3173	June 22, 2012

Sharing GPUs on multi-gpu, multiuser systems When cudaSetDevice() goes bad.

Related topics