OpenMP & CUDA

Drinker · September 22, 2008, 5:33pm

I have the following code:

OMP_SET_NUM_THREADS(2);

#pragma omp parallel

{

     unsigned int cpu_id = OMP_GET_THREAD_NUM();

     cudaSetDevice( cpu_id );

     if(cpu_id==0)

             execute_kernel1<<..>>(...);

     else

             execute_kernel2<<..>>(...);

}

Now I have the following question, will these two kernels launch concurrently on two separate cards?

E.D_Riedijk · September 22, 2008, 5:45pm

not without explicit coding.

Drinker · September 22, 2008, 5:50pm

So I have to do it like in the multiGPU example of the cuda SDK, OpenMP is not enough?

E.D_Riedijk · September 22, 2008, 6:08pm

oh wait, did not notice that.
Well, if you have declared all the variables on the right GPU too and moved the required memory on it, then probably that should work as far as I can tell, but I have never played with openmp, so don’t take my word for it.

tmurray · September 22, 2008, 6:14pm

it should work. if I saw OpenMP code like this in a program I’d probably kill you (hard-coding all of that? such a bad idea), but that’s neither here nor there…

Drinker · September 22, 2008, 6:22pm

Yeah I have everything set up for both cards.
I have also forgot to mention that bot kernels are running inside a loop.
I am asking this question because when I measure the execution time of this code, and the same code with one kernel call commented out. The one kernel call version runs almost exactly half the time of the two kernel version. It seems to me like the kernel calls are being serialized. So I would like an opinion from an expert.

EDIT:
It just a simple example not an actual application…

paulius · September 22, 2008, 8:17pm

Yes, these will go to two different GPUs.

I think there is a cudaOpenMP sample in the SDK version for Windows.

Paulius

Topic		Replies	Views
CUDA + OpenMP CUDA Programming and Performance	2	788	December 8, 2016
OpenMP and CUDA Legacy PGI Compilers (archived)	1	752	July 10, 2020
OpenMP Multi-GPU, not getting speedup expected CUDA Programming and Performance	5	5969	July 15, 2011
CUDA & openMP Problem with the SDK sample code CUDA Programming and Performance	11	14189	September 12, 2015
Launching Kernels in simultaneously on two GPUs CUDA Programming and Performance	6	20963	May 26, 2011
OpenMP with Cuda Documentation CUDA Programming and Performance	2	1224	August 9, 2013
MutiGPU (OpenMP) - Tasks runs in series several times. Using OpenMP to run in 2 GPUs. Parallel secti CUDA Programming and Performance	0	925	December 26, 2011
Multi-GPU with OpenMP CUDA Programming and Performance	3	2505	October 31, 2018
OpenMP + CUDA Multiple Parallel Sections Does GPU to Thread linking persist across multiple parallel CUDA Programming and Performance	11	3808	June 29, 2011
CUDA + OpenMP oddity - looks like a compiler bug. Legacy PGI Compilers (archived)	6	12292	April 12, 2010

OpenMP & CUDA

Related topics