Scheduling of kernels from different processes. Questions on how CUDA devices are shared amongst com

Arakageeta · October 21, 2009, 8:47pm

I have many questions on how a CUDA device is shared amongst competing processes. I haven’t been able to find much information on how this is actually done.

Does anyone have any information (or educated guesses) on how CUDA executes/schedules kernels from different processes (or contexts?)? Basically, how is the CUDA device shared amongst the different processes? I understand that a CUDA device can be placed in exclusive-use mode but I’m more interested in how it is shared.

Are kernels queued up on the driver/host side or on the device? Is there any ordering to the queue (regardless of host vs device side), or is it merely FIFO? If there is an ordering, is it ordered by job size parameters and/or how long a kernel has been waiting to execute? How about fairness issues where a process with many kernels could overwhelm others?

Finally, with Fermi’s multi-kernel execution (of kernels from the same context), any speculation on what might prevent starvation of other tasks? I’m thinking of a case where a context could keep its “foot in the door” to feed in more kernels.

Topic		Replies	Views
CUDA processor allocation CUDA Programming and Performance	7	3434	October 5, 2007
CUDA Runtime Scheduling CUDA Programming and Performance	0	2777	October 20, 2011
CPU multithread with several devices CUDA Programming and Performance	4	5375	May 13, 2011
Sharing a GPU server for CUDA programming in a multi-user operating system CUDA Programming and Performance	4	18328	January 3, 2019
Fermi CUDA Programming and Performance	3	7724	March 25, 2010
CUDA 4.0 concurrent kernels CUDA Programming and Performance	6	1670	March 28, 2011
Invoking kernel from multiple PC processes CUDA Programming and Performance	1	5501	June 3, 2011
Can I utilize Concurrent Kernel Execution among processes with the same context? CUDA Programming and Performance	0	503	December 9, 2016
CUDA Command Queue... makes sense? CUDA Programming and Performance	2	12929	December 17, 2009
Using CUDA Inter Process Communication Between Multiple Applications Container: CUDA cuda , kernel	0	1209	August 17, 2020

Scheduling of kernels from different processes. Questions on how CUDA devices are shared amongst com

Related topics