Calling CUDA function disables OpenMP? Can they co-exist in the same application?

AlfredDube · June 4, 2010, 2:40pm

I have an image processing function that’s implemented for both CUDA and OpenMP. Both implementations run fine when run separately.

Then I created a benchmark to compare processing times for both implementations and I found a problem: once the CUDA implementation has been executed, the OpenMP implementation stops being optimized. Instead of being split into 4 threads, the loop runs on a single thread. The processing time goes up and I can see the CPU usage going down to 25% instead of 100% (I have a 4-cores computer).

What can cause this? I thought the APIs were independent. I successively removed portions of the CUDA code and found that OpenMP become disabled as soon as I call cudaMallocPitch to allocate an image buffer on the device.

If anyone has any kind of insight on what is going on please let me know!

I’m using a GT 240 with driver 197.13 and CUDA 3.0 in Windows XP and Visual Studio 2005. The CUDA implementation is run within a DLL that creates a thread for each GPU found in the computer in order to serialize all requests for that GPU.

Sarnath · June 7, 2010, 6:33am

That looks weird… I am curious to see if therez a compatibility issue out there…

tmurray · June 7, 2010, 7:05am

are you using the profiler? that sets CPU affinity for timing purposes. as does cutil, probably.

(insert your very own “seriously guys don’t use cutil for anything, you don’t know what it actually does” plea here)

Topic		Replies	Views
OpenMP & CUDA CUDA Programming and Performance	6	5241	September 22, 2008
OpenMP Multi-GPU, not getting speedup expected CUDA Programming and Performance	5	5903	July 15, 2011
OpenMP CPU Thread Affinity loss when using cudaMalloc Problems with Thread Affinity after calling cu CUDA Programming and Performance	4	2122	June 27, 2009
MultiGPU, multithread, and establishing contexts Odd (but good) behavior with OpenMP affecting multi CUDA Programming and Performance	4	6290	July 10, 2009
openMP faster than GPU? CUDA Programming and Performance	2	2060	June 15, 2012
CUDA + OpenMP oddity - looks like a compiler bug. Legacy PGI Compilers	6	12214	April 12, 2010
OpenMP with Cuda Documentation CUDA Programming and Performance	2	1187	August 9, 2013
CUDA + OpenMP CUDA Programming and Performance	2	728	December 8, 2016
Does kernel execution still block one CPU? CUDA Programming and Performance	4	10372	October 26, 2007
Cuda 3.0 and OpenMP cudaOpenMP not working CUDA Programming and Performance	1	2636	April 2, 2010

Calling CUDA function disables OpenMP? Can they co-exist in the same application?

Related topics