Using a device function pointer. Problems using a pointer to a device function.

Cyril_Schmidt · June 13, 2012, 1:59pm

I am trying to make a kernel that invokes a device function via a pointer. It works well as long as the function and its caller reside in the same source (.cu) file, but breaks if they are in different files. Here is the full example code.

FuncPointer.h:

#ifndef FuncPointer_h

#define FuncPointer_h

typedef float (*op_func) (float, float);

struct FuncPointer {

	FuncPointer();

    op_func fptr;

};

#endif // FuncPointer_h

Main.cu:

#include <cstdio>

#include "FuncPointer.h"

/// start of FuncPointer.cu

__device__ float add_func (float x, float y)

{

    return x + y;

}

__device__ op_func func = add_func;

FuncPointer::FuncPointer() {

    cudaMemcpyFromSymbol(&fptr, func, sizeof(func));

}

/// end of FuncPointer.cu

__global__ void kernel (FuncPointer* p)

{

    float x=100, y=10, result=0;

    result = p->fptr(x, y);

    printf ("result = %f\n", result);

}

int main () 

{

    FuncPointer fp;

	FuncPointer* dev_fp;

	cudaMalloc(&dev_fp, sizeof(FuncPointer));

	cudaMemcpy(dev_fp, &fp, 

		sizeof(FuncPointer), cudaMemcpyHostToDevice);

	

	kernel<<<1,1>>>(dev_fp);

	cudaFree(dev_fp);

return EXIT_SUCCESS;

}

This works as expected.

Note the code section between [font=“Courier New”]/// start of FuncPointer.cu[/font] and [font=“Courier New”]/// end of FuncPointer.cu[/font].

If I move this code from Main.cu into another file FuncPointer.cu and link them together, the execution stops with the “unspecified launch error” message.

What is wrong with calling a device function from another file by pointer?

A similar question was asked in this post, but never answered.

njuffa · June 13, 2012, 6:26pm

Calling a device function defined in a different compilation unit requires linking of device code, so references to other compilation units can be resolved. Up to and including CUDA 4.2, there is no support for linking of device code. In this case the function pointer is a device variable not accessible from outside the compilation unit it is defined in.

As was announced at GTC, CUDA 5.0 will provide for (static) linking of device code. See for example this presentation by our chief technologist for GPU computing, Mark Harris: http://developer.download.nvidia.com/GTC/PDF/GTC2012/PresentationPDF/S0641-GTC2012-CUDA-5-Beyond.pdf

A CUDA 5.0 preview is available to registered developers. I believe (but have not checked) that the new linker is part of the preview. Please note that the stability and maturity of the preview should not be assume to be on par with that of release candidates.

Cyril_Schmidt · June 15, 2012, 6:57am

Thanks for the explanation; I will try CUDA 5.0 preview out!

BTW, I see the same problem if I try to invoke a virtual function that is defined in a different compilation unit. For the same reason, obviously.

Topic		Replies	Views
Consistency of functions pointer CUDA Programming and Performance	5	3068	June 21, 2013
__device__ CUDA Programming and Performance	7	3842	December 12, 2011
external calls to __device__ functions CUDA Programming and Performance	4	4936	July 20, 2010
A pointer to a function CUDA Programming and Performance	7	1337	May 13, 2016
How to separate device function and kernel function? CUDA Programming and Performance	2	1554	November 22, 2009
Device Function Library How to make a lib of device functions CUDA Programming and Performance	6	4859	June 24, 2009
How can I use __device__ function pointer in CUDA ? CUDA Programming and Performance	34	60194	June 3, 2020
Is this correct way to code function pointers? CUDA Programming and Performance	4	2455	March 12, 2009
Multiple definition error on device function in header file CUDA Programming and Performance	4	25253	March 24, 2011
__device__ functions CUDA Programming and Performance	9	3113	November 10, 2010

Using a __device__ function pointer. Problems using a pointer to a __device__ function.

Related topics

Using a device function pointer. Problems using a pointer to a device function.