Template function pointer

roydesbois · October 13, 2011, 1:51pm

Hi all,

Passing a function pointer to a CUDA kernel is described in detailed in the SDK project FunctionPointers.

In that example, the function accepts a few unsigned char and float parameters and returns an unsigned char.

My kernel is a TEMPLATE kernel and I would like to pass a TEMPLATE function to it.

To this end, I have tried the following:

Define a template typedef of the type of function I would like to pass as parameter (typedef cannot be templated as-is, it must be wrapped in a struct/class).

template<class T>

struct Operators

{

    typedef T (*Operator)(T);

};

Define a device template function ‘d_Inverse’ of type device Operators::Operator

__device__ Operators<T>::Operator d_Inverse = Inverse;

where Inverse is a host function with a number of overloaded implementations, e.g.,

int Inverse(int i){return 1/i;}

float Inverse(float f){return 1/f;}

Before the kernel launch, called within a templated “extern” function, I copy the address of ‘d_Inverse’ to a host variable ‘op’.

Operators<T>::Operator op = NULL;

cudaError_t error = cudaMemcpyFromSymbol((void*)&op, d_Inverse, (size_t)sizeof(Operators<T>::Operator));

I call my kernel with op as parameter of type

Operators<T>::Operator.

This code mimics the strategy adopted for non-templated functions. Unfortunately it fails to compile. In particular, the call

Operators<T>::Operator op = NULL;

returns a compilation error, whereas the instantiated version of it

Operators<float>::Operator op = NULL;

does not.

Help is greatly appreciated!

Thanks,

Olivier

MarkusM · October 14, 2011, 8:31am

A bit more context and the exact wording of the compiler errors could be helpful.
Nevertheless I suspect it comes down to a simple problem: There are no template function pointers, since there are no template functions per se, only templates for functions. This means you can only take the pointer of an instantiated template function like the mentioned Operators::Operator.

roydesbois · October 14, 2011, 3:11pm

Thanks for your answer. Indeed, the code I posted works fine if I compile it for a particular instantiation of the template T. In this case, however, the “extern” function that calls the GPU kernel must be instantiated for all the template types I would like to implement. I wanted to avoid this code redundancy…

Topic		Replies	Views
Executing kernels via function pointer CUDA Programming and Performance	1	956	March 6, 2013
CUDA vector type reference CUDA Programming and Performance	2	2822	January 31, 2008
forcing template compilation in CUDA C CUDA Programming and Performance	9	21304	April 5, 2011
Is it possible to pass stateful functions to kernel? CUDA Programming and Performance	4	61	October 14, 2024
A problem with template and kernel call Compilation fails in this case CUDA Programming and Performance	2	911	April 22, 2010
Template function calling a kernel with separated files architecture The normal function works, the CUDA Programming and Performance	2	1680	December 17, 2009
Templated arguments / shared memory CUDA Programming and Performance	8	2134	September 8, 2008
Templated Functions on the Device CUDA Programming and Performance	1	2072	June 21, 2007
How to run templatized global function cuda templates CUDA Programming and Performance	6	28513	November 30, 2009
A pointer to a function CUDA Programming and Performance	7	1334	May 13, 2016

Template function pointer

Related topics