Help using single GPU among multithreaded CPU

miguelL · October 8, 2013, 8:51pm

I have a CPU split into 8 threads. I want to split up the resources of my GPU evenly among each thread. I have a GPU capable of having 65536 blocks. I would like to have 8000 blocks set aside for each thread of my CPU.

What is the best way to go about this? My algorithm knows which CPU thread it is on. I have two ideas for how to do this but don’t know which if either would work.

Is it possible I could specify which blocks to use on my GPU? i.e. 0-7999 for CPUThread 1, 8000-15999 for CPUThread 2…
Is it possible for the different CPU threads to each call the GPU kernel method individually (but in parallel)? Example below:

global void kernel( float* a, float *b, float *c, int *CPUThreadIndex )
{
if ((CPUThreadIndex * 8000) <= blockIdx.x && blockIdx.x < ((CPUThreadIndex + 1) * 8000))
{
//Do stuff on the kernel
}
}

void main(float a, float b, int CPUThreadIndex)
{
…
kernel<<<64000, 1>>>( dev_a, dev_b, dev_c, dev_CPUThreadIndex);
…
}

If each seperate CPU thread goes to the main function around the same time would something like the method above work in parallel?

Thanks for any help you guys can provide.

m_colaprico · October 9, 2013, 7:28am

Do threads belong to the same CUDA Context?

I have worked with multi CPU thread using all the same GPU, and i have found out these aspects:

with CUDA 5 and Kepler, more streams belonging to the same CUDA Context can share GPU resources “at the same time”.
if you want to share GPU between different thread, you have to create one CUDA Context for each CPU thread, but only one context at a time use the GPU.

I’m sure i haven’t been so clear, so someone more expert can help us.

tzuhung · October 18, 2013, 3:44am

In CUDA C programming guide p.61:

Default compute mode: Multiple host threads can use the device (by calling cudaSetDevice() on this device, when using the runtime API, or by making current a context associated to the device, when using the driver API) at the same time.

I think m_colaprico is right. You can use cudaSetDevice() at each CPU thread for sharing the same GPU. But the kernel may be not executed in parallel.

m_colaprico · October 18, 2013, 7:17am

Thanks tzuhung!
Moreover, i have learned you can execute in parallel multiple CPU thread on the same GPU only on a TESLA Kepler, using HyperQ with MPI task.

I will try if it is possible also with OpenMP CPU thread.

pasoleatis · October 18, 2013, 10:57am

Does this really work?

Topic		Replies	Views
Multiple thread/process access to single GPU CUDA Programming and Performance	5	5977	May 13, 2008
Threaded CUDA Multiple concurrent kernels? CUDA Programming and Performance	9	5594	October 20, 2009
Mapping between CUDA cores and threads CUDA Programming and Performance	7	15398	December 2, 2011
Multiple host thread on a single GPU CUDA Programming and Performance	2	5195	February 10, 2012
MultiGPU Thread Dependent? CUDA Programming and Performance	3	2246	February 12, 2010
Synchronizing Blocks CUDA Programming and Performance	3	2436	January 10, 2018
code examples: using CPU threads can I see code for any apps using Pthreads on CPU? CUDA Programming and Performance	3	1125	June 9, 2010
CUDA processor allocation CUDA Programming and Performance	7	3437	October 5, 2007
A question the parallelization CUDA Programming and Performance	1	1184	July 28, 2008
CPU Cores Per GPUs CUDA Programming and Performance	11	2451	April 14, 2013

Help using single GPU among multithreaded CPU

Related topics