Multiple Independent Host Processes with One GPU Board

jm1 · May 19, 2011, 4:48pm

[b][font=“'Lucida Console”]I have several independent Host processes that all have GPU kernels they want to run on their own data. I am trying to understand [/font]

[font=“'Lucida Console”]how a CUDA GPU board gets shared among the Host processes. I am guessing that each Host process inaugurates its own context and thus each host process can use the entire global memory on the board.[/font]

[font=“'Lucida Console”]I am trying to understand the blocking and sharing mechanisms. specifically if the GPU is busy executing a kernel for one host process and a second host process tries to access the gpu board then does the[/font]

[font=“'Lucida Console”]second process block until the GPU is free? [/font]

[font=“'Lucida Console”]

[/font]

[font=“'Lucida Console”]I would like to hear from Nvidia engineers who can reference specific pages in Nvidia documentation that describes this phenomena.[/font]

[font=“'Lucida Console”]

[/font]

[font=“'Lucida Console”]thanks[/font]

[font=“'Lucida Console”]

[/font]

[/b]

hyqneuron · May 23, 2011, 11:22am

A quick check on the programming guide (streams, context, concurrent execution) should get all your questions answered. I do not have the guide with me (on iPod right now). I’ll just put what I remember here:
Each CPU process gets different context. Kernels from different contexts cannot execute in parallel for now. I think Nvidia is working to enable this.
A context itself occupies some memory space. Different contexts have different virtual address spaces. Meaning each context only has limited amount of global memory that it could use. Though the device does not seem to exercise a strict access control mechanism. You could even black out your screen when you misuse your pointers. So this perhaps would mean that, in one way or another, you do have access to the entirety of the global mem on your device for each context.

Anyway, you can check up the guide yourself to confirm things. I remember they are somewhere in sec 3.2.x.x

Topic		Replies	Views
questions memory allocation and CUDA contexts CUDA Programming and Performance	7	11373	February 4, 2008
How to share GPU memory from different host threads? CUDA Programming and Performance	6	2394	July 14, 2010
CUDA multiple contexts CUDA Programming and Performance	0	5514	April 19, 2007
global cuda memory and os-threads CUDA Programming and Performance	13	12437	January 21, 2009
Can I utilize Concurrent Kernel Execution among processes with the same context? CUDA Programming and Performance	0	526	December 9, 2016
Multiple thread/process access to single GPU CUDA Programming and Performance	5	6065	May 13, 2008
Data setup for multi-gpu program can't setup outside of thread? CUDA Programming and Performance	3	2811	July 20, 2007
Host Thread Multiple Devices CUDA Programming and Performance	6	1325	October 7, 2009
Contexts and cudaMallocHost Same rules? CUDA Programming and Performance	17	11355	November 15, 2008
Why exactly cant you share CUDA ressources amongst different host threads? CUDA Programming and Performance	1	3777	November 26, 2009

Multiple Independent Host Processes with One GPU Board

Related topics