How to copy a host function pointer to device in CUDA

Narrow · June 15, 2022, 2:04pm

Hello everyone.
I recently started porting a cpp matrix library to cuda code in order to accelerate the matrix operations.

in the old cpu-based code, i had a function in the matrix class, .apply(double (*func)(double)), that applied a function to each element of the matrix;

Matrix Matrix::apply(double (*func)(double)) const
{
    Matrix applied(this->rows, this->cols);

    for (int i = 0; i < this->rows; i++)
        for (int j = 0; j < this->cols; j++)
            applied.data[i][j] = func(this->data[i][j]);
    
    return applied;
}

Now, how would i be able to sort of “copy” the function pointer from host to device, in order to give it to a kernel that does the exact same thing only parallelized between multiple threads like this:

void applyWrapper(double** A, double (*hostFuncPtr)(double), size_t rows, size_t cols
{
     auto deviceFuncPtr = /* copy the function pointed by hostFuncPtr into a device function pointer */
     applyKernel<<<1, 1>>>(A, deviceFuncPtr, rows, cols);
}

void applyKernel(double** A, double (*devFuncPtr)(double), size_t rows, size_t cols)
{
    size_t idx = threadIdx.x;
    size_t stride = blockDim.x;

    for (size_t i = idx; i < A_cols; i += stride)
        for (size_t j = 0; j < A_rows; j++)
            A[j][i] = func(A[j][i]);
}

TomNVIDIA · June 15, 2022, 3:10pm

Hi,

This is the Networking category, this issue should be posted in one of the CUDA forums. I will move it over for you.

Robert_Crovella · June 15, 2022, 7:20pm

cuda sample codes include a sample demonstrating function pointer usage

any function pointer you want to use in device code had better be obtained from a properly decorated device function. You cannot pass a pointer that points to a host function and expect to do anything sensible with it in device code.

The cuda sample code will make this evident.

jccporter · July 7, 2024, 4:33pm

Would you kindly be able to provide a link to this sample code?

rs277 · July 7, 2024, 6:45pm

Curefab · July 8, 2024, 4:38pm

If the function pointer at some point is known at compile-time, you could create a template parameter. For example replace the function pointer with a class type with a suitable member function.

warpdivergence77 · July 10, 2024, 12:37pm

You can’t pass a pointer to host function to the device, the address spaces are separate. Trying to call that in device code would be nonsensical.

Topic		Replies	Views
Warp Invalid PC, device function pointer CUDA Programming and Performance	4	1128	May 29, 2019
Function pointers crashing kernel calls CUDA Programming and Performance	1	2938	August 8, 2011
A pointer to a function CUDA Programming and Performance	7	1451	May 13, 2016
Array of function pointers assignment CUDA Programming and Performance	6	1154	March 23, 2022
How can I use __device__ function pointer in CUDA ? CUDA Programming and Performance	34	60914	June 3, 2020
pointer as function parameters CUDA Programming and Performance	1	939	September 11, 2009
device function pointers CUDA Programming and Performance	0	578	January 25, 2012
Is this correct way to code function pointers? CUDA Programming and Performance	4	2540	March 12, 2009
How to access kernel modified data on host CUDA Programming and Performance	7	5393	March 31, 2009
Passing a function ptr CUDA Programming and Performance cuda	2	611	April 14, 2021

How to copy a host function pointer to device in CUDA

Related topics