Making Function Calls within Accelerator Code Blocks

Pebbles1 · September 28, 2010, 2:06pm

I have read that function calls cannot be made within the pragmas of code blocks to be accelerated. Is there a way around this besides in-lining all of the function calls? Or is this something that will be available with future releases? I know that OpenMP allows function calls within the code blocks to be accelerated and would like to do the same with PGI Accelerator.

MatColgrove · September 28, 2010, 3:56pm

Is there a way around this besides in-lining all of the function calls?

No, though the compiler is able to perform automatic inlining (see -Minline/-Mextract, -Mautoinline, and -Mipa=inline). It doesn’t work in all cases, but worth a try before hand inlining the rountines.

I should note that this not a PGI limitation, rather a general limitation with NVIDIA. CUDA C and CUDA Fortran appear to allow calls, but in reality all calls get inlined.

Or is this something that will be available with future releases?

Possible, but there are number of technical challenges that need to be first overcome. The first being a lack of a linker for device code. Without a linker there isn’t a way to associate symbols. Second, is the lack of context switches and software stack during runtime. Though NVIDIA has added better support for this. Third, we need to way to ensure that the function being called has a device version. There are most likely more, but these are the ones that come to mind.

We definitely have the desire to be able to allow function calls within acc compute regions. It is one the major limitations of the model and one of the most requested features.

Thanks for your interest,
Mat

Topic		Replies	Views
Function calls within a pragma region! Legacy PGI Compilers	1	1729	July 22, 2011
Calling functions within the kernels Legacy PGI Compilers	6	5025	July 25, 2011
function or procedure calls are not allowed Legacy PGI Compilers	1	2373	July 5, 2012
Question about loop including function/procedure call Legacy PGI Compilers	3	2202	April 24, 2012
function/procedure calls not supported Legacy PGI Compilers	5	7482	March 2, 2012
Applying PGI accelerator to a complex CFD program Legacy PGI Compilers	1	3850	December 2, 2010
Loop contains call Legacy PGI Compilers	5	5845	November 7, 2015
Inlining with pragmas Legacy PGI Compilers	5	6222	April 23, 2014
Is it possible to call a CUDA kernel from PGI compiled code? Legacy PGI Compilers	7	4685	November 28, 2012
Kernel code not generated because function not inlined Legacy PGI Compilers	1	2207	February 12, 2013

Making Function Calls within Accelerator Code Blocks

Related topics