Launching Kernels in simultaneously on two GPUs

ParallelPoint · February 3, 2011, 6:31am

If I have two independent Kernels how is it possible to launch one, then another on two independent GPUs?

Can I just create two independent threads and push the kernel onto device 0 then device 1?

Anyone know if a “cuda stream” can satisfy this purpose?

avidday · February 3, 2011, 6:57am

At the moment, streams are only for controlling asynchronous operations on within a single GPU context. If you are using two GPUS, you currently need to use two threads because GPU contexts require their own thread.

dgiann · February 3, 2011, 7:50pm

I am trying to do something like that.
Two CPU threads, setting different devices on each thread and calling the kernel.

But in execution, only one of the kernels is executed, one time is device 0 and another time device 1.

My program is based on cudaOpenMP but I cannot find what’s wrong.

Do I need a compiler flag or something?

avidday · February 3, 2011, 7:58pm

It is probably context, thread and GPU affinity issues, which are very hard to manage correctly with CUDA and OpenMP as things stand today. You might want to consider using something different for threading (say boost or a native thread library).

EDIT: Of course their also is the possibility that both contexts are winding up on the same GPU. How are you assigning GPUs in the code?

dgiann · February 4, 2011, 12:45am

I started from the beginning with openmp again cause that’s the project, it works up to this point.
The assigning is exactly the same, based on omp_get_thread_num, and I set 2 omp threads, as many as the GPUs.

I really haven’t managed to find the problem.

But I can say that things are very fragile. I had to compile/execute after every line of code, a little mistake could have very weird results

Thanks.

ParallelPoint · February 6, 2011, 5:57am

I’m using Windows 7 so I’ll post my findings when I’m done coding. I have used pthreads on linux, but I’m new to threading on windows.

Give me a few days.

I’m aiming to launch launching two kernels on two separate devices. Make sure that you launching kernels to the same device form different threads.

MKasper · May 26, 2011, 9:52am

maybe this helps:

Topic		Replies	Views
Multiple GPU computing CUDA Programming and Performance	8	7990	May 7, 2008
How to start 2 kernels on 2 devices CUDA Programming and Performance	16	10692	January 7, 2009
OpenMP & CUDA CUDA Programming and Performance	6	5279	September 22, 2008
Streams and multi-gpu CUDA Programming and Performance	10	2311	June 17, 2014
Multiple host thread on a single GPU CUDA Programming and Performance	2	5269	February 10, 2012
Multi-GPU with a single thread and driver API? CUDA Programming and Performance	5	5086	July 25, 2008
Paralelling cudaMalloc in different GPU cards CUDA Programming and Performance	1	641	April 18, 2015
Multi-CPU process Multi-GPU CUDA Programming and Performance	2	1612	December 30, 2010
Multiple GPUs and streams CUDA Programming and Performance	4	4675	December 18, 2008
CUDA using Multiple devices CUDA Programming and Performance	5	3403	June 22, 2009

Launching Kernels in simultaneously on two GPUs

Related topics