Calling template kernel from template functio

michael.hansen · February 14, 2008, 3:39pm

Hi,

I’ve been using CUDA on Linux for a while and was happy to see that I should be able to run it on my Macbook Pro now. I’ve installed everything fine and all the examples in the SDK run fine (at least the ones I have tested).

However, I have some problems with compiling some code which I have successfully compiled on Linux and Windows. The situation is as follows:

In a .cu (let’s call it example.cu) file I have a template function and a template kernel which is called from the template
function:

template global void my_kernel( cuFloatComplex* data_in, cuFloatComplex* edata_out, T dim)
{
…
}

The function which calls it would be look something like this:

template void
my_function(cuFloatComplex* data_in, cuFloatComplex* data_out, T dim)
{
…

my_kernel<<< gridDim, blockDim >>>(data_in, data_out, dim);

…
}

I can compile this on Linux and Windows and it runs. When trying to compile on the Mac, I get an error message:

example.cu: In function ‘void my_function(cuFloatComplex*, cuFloatComplex*, T)’:
example.cu:82: error: ‘my_kernel’ was not declared in this scope
make: *** [obj/release/FFT.cu_o] Error 255

If I explicitly declare overloaded my_functions for all the types I need to use, then it compiles fine, but it is a bit messy and also strange that it should be necessary.

Any hints, comments, etc would be most welcome.

Thanks,
Michael

mfatica · February 14, 2008, 3:43pm

Try to add this line at the end of your makefile

NVCCFLAGS += --host-compilation ‘C’

michael.hansen · February 14, 2008, 4:19pm

Thanks! That did the trick. What exactly does that flag do?

Cheers,

Michael

Topic		Replies	Views
Template function calling a kernel with separated files architecture The normal function works, the CUDA Programming and Performance	2	1712	December 17, 2009
templates under XP/debug compilation error using templates/XP CUDA Programming and Performance	1	4012	May 7, 2008
A problem with template and kernel call Compilation fails in this case CUDA Programming and Performance	2	953	April 22, 2010
templated functions problem with CUDA CUDA Programming and Performance	5	9320	May 19, 2009
error: `__device_stub___globfunc__' undeclared NVCC compilation issue CUDA Programming and Performance	0	2075	March 18, 2009
how to call kernal in .cpp file CUDA Programming and Performance	4	2744	September 7, 2009
kernel template Teaching & Curriculum Support	2	1924	July 6, 2013
Templates and global functions for unrolling CUDA Programming and Performance	0	3215	January 18, 2010
Possible bug with templated CUDA code in -deviceemu mode Templated CUDA code wouldn't link in -d CUDA Programming and Performance	2	1711	April 14, 2009
Compiler issue CUDA Programming and Performance	1	550	September 21, 2011

Calling template kernel from template functio

Related topics