fortran cuda interface passing pointer from fortran and allocating memory on device

nachikets · May 14, 2010, 7:03am

Hello,
I intend to pass pointer to cuda code and get it allocated a chunk on memory on device. Following is my code. Its getting compiled well, but output is incorrect. Can anyone help me out ?
FORTRAN code:–

PROGRAM memtest
USE iso_c_binding
INTERFACE
SUBROUTINE get_in(x,n) BIND(C,name=“get_in”)
USE iso_c_binding
TYPE(C_PTR) :: x
INTEGER(C_INT),VALUE :: n
END SUBROUTINE
END INTERFACE
TYPE(C_PTR) :: px
INTEGER,POINTER :: x( : )
INTEGER(C_INT) :: n=10
CALL get_in(px,n)
END PROGRAM

CUDA-Program in C
#include<stdlib.h>
#include<stdio.h>

extern “C” void get_in(int **x,int n)
{

int y=(int)calloc(n,sizeof(int));
y[0]=10;
x=(int**)malloc(sizeof(int*));

cudaMalloc((void**)x,nsizeof(int));
cudaMemcpy(x[0],y,nsizeof(int),cudaMemcpyHostToDevice);
y[0]=11;

printf(“\ny0=%d”,y[0]);
cudaMemcpy(y,x[0],n*sizeof(int),cudaMemcpyDeviceToHost);
printf(“\ny0=%d”,y[0]);

}
compilers used: gcc 4.3.3
nvcc: release 3.0, V0.2.1221
Ideal output:
y0=11
y0=10

Actual output
y0=10
y10=10

Can anyone point out possible bug in my code ?

thanks and regards,
Nachiket

avidday · May 14, 2010, 7:36am

All arguments in Fortran code are passed as pointers, so any C/C++ code called by Fortran must also have pointer arguments, not pass-by-value or pass-by-reference. It means that the CUDA calls are probably failing on the n value you pass, but you have no error checking and don’t see it.

nachikets · May 14, 2010, 7:45am

Yes. But gcc-4.3.3 comes with a support for passing arguments by value as well as reference in FORTRAN. If you notice, the data-type for arguments of function get_in is C_PTR and C_INT. Its called C-Interoperability. This feature has been added from FORTRAN 2003 onwards. Instead of CUDA code if I allocate memory to a pointer in C code, this code is running absolutely fine.

Btw. there is a typo in what I have written:

Actual output:

y0=11

avidday · May 14, 2010, 8:16am

Indeed, it seems the C interop stuff acutally works in gfortran 4.3 (it certainly didn’t work well last time I tries it, which was circa gfortran 4.1 or whatever Redhat shipped with ES5.1).

This worked for me:

avidday@cuda:~/code/fortrancuda$ gcc -c -I$CUDA_INSTALL_PATH/include fcuda.c -o fcuda.o

avidday@cuda:~/code/fortrancuda$ gfortran -o memtest memtest.f90 fcuda.o -L/opt/cuda-3.0/lib64/ -lcudart

avidday@cuda:~/code/fortrancuda$ ./memtest 

y[0](before)=11

y[0](after)=10

avidday@cuda:~/code/fortrancuda$ cat fcuda.c

#include <stdio.h>

#include <stdlib.h>

#include <assert.h>

#include "cuda_runtime.h"

#ifndef gpuAssert

#include <stdio.h>

#define gpuAssert( condition ) { if( (condition) != 0 ) { fprintf( stderr, "\n FAILURE %s in %s, line %d\n", cudaGetErrorString(condition), __FILE__, __LINE__ ); exit( 1 ); } }

#endif

void get_in(int **x,int n)

{

	int *y=(int*)calloc(n,sizeof(int));

	y[0]=10;

	gpuAssert( cudaMalloc((void**)x,n*sizeof(int)) );

	gpuAssert( cudaMemcpy(*x,y,n*sizeof(int),cudaMemcpyHostToDevice) );

	y[0]=11;

	printf("y[0](before)=%d\n",y[0]);

	gpuAssert( cudaMemcpy(y,*x,n*sizeof(int),cudaMemcpyDeviceToHost) );

	printf("y[0](after)=%d\n",y[0]);

	free(y);

}

nachikets · May 14, 2010, 10:08am

Hello,

I compiled your code successfully but failed to run it. I received following error:
FAILURE CUDA version is insufficient for CUDART version in alloc_mem.cu, line 43

Also I have one more query:

How did you manage to compile without prefixing the function get_in with extern “C” ? For me it did not work.

thanks and regards,
Nachiket

avidday · May 14, 2010, 11:22am

The driver you are using is too old for the CUDA version you have. For CUDA 3.0 you must use a 195 series driver on Linux.

As you can see, I compiled the CUDA containing function as plain C using gcc. There is no device code, so there is no need to use nvcc in this case - nvcc uses C++ host code compilation by default, although this can be changes from the command line if you don’t want host code compiled as C++.

nachikets · May 14, 2010, 12:25pm

Hello,

My machine is Tesla. I am not sure whether 195 series is appropriate for it. Anyways thanks for reply.

avidday · May 14, 2010, 2:34pm

On linux, the driver is common between consumer Geforce and Telsa cards. You won’t get this to work unless the SDK and driver versions match. Either downgrade to an older SDK version or upgrade the driver.

nachikets · May 14, 2010, 5:19pm

Okay. Will try this solution. Thanks!

Topic		Replies	Views
fortran cuda interface passing pointer from fortran and allocating memory on device CUDA Programming and Performance	0	927	May 14, 2010
Calling CUDA C from fortran CUDA Programming and Performance	4	1024	December 4, 2021
MPIFORT + CUDA FORTRAN - Passing pointer from Fortran (MPIFORT) to CUDA Fortran (PGIF90) and allocating memory on device Legacy PGI Compilers cuda	2	1064	June 18, 2021
Managing device memory in C and passing pointers in Fortran Legacy PGI Compilers	1	2369	March 11, 2016
return a device pointer from a C function call back to Fortr Legacy PGI Compilers	1	2993	October 6, 2010
interoperability with C, device data Legacy PGI Compilers	2	2235	February 15, 2010
Setting a pointer inside a cuda fortran kernel nvc, nvc++ and nvfortran	4	121	April 11, 2025
CUD Fortran - Device allocatable variable in and c_f_pointer Legacy PGI Compilers	2	3710	April 15, 2011
needs help in loc and a pointer for device Legacy PGI Compilers	1	3613	February 16, 2015
Host to device memory zero copying CUDA pointer couldn't see host memory defined in Fortran CUDA Programming and Performance	9	10175	July 20, 2010

fortran cuda interface passing pointer from fortran and allocating memory on device

Related topics