Cannot allocate variables from used modules

cknaus1 · November 9, 2010, 3:37pm

Hi,

Here is a test case where I allocate a device variable which is declared in a separate module.

–8<-- compile with pgf90 -Mcuda mod1.cuf kernel.cuf main.f90

PROGRAM main
USE test_mod
CALL test
END PROGRAM

–

MODULE mod1
IMPLICIT NONE
INTEGER, DEVICE, ALLOCATABLE :: foo
CONTAINS
END MODULE

–

module test_mod
use cudafor
use mod1 ! this fails
! INTEGER, DEVICE, ALLOCATABLE :: foo ! this works
contains

attributes(global) subroutine test_kernel( )
foo = 42
end subroutine

subroutine test
REAL*8 temp
integer r
type(dim3) :: dimGrid, dimBlock

allocate(foo)

dimGrid = dim3(1, 1, 1)
dimBlock = dim3(1, 1, 1)
call test_kernel<<<dimGrid,dimBlock>>>()

r = cudathreadsynchronize()
write(0,*) "Value of cudathreadsynchronize = ", r

temp = foo
write(0,*) temp

end subroutine
end module

–8<–

When I run the program, it produces the following output:

–
Value of cudathreadsynchronize = 4
copyout Memcpy (host=0x7ffffa7af254, dev=0x110000, size=4) FAILED:4

However, if I declare the device variable inside the same module where it is allocated, then the test succeeds. My system configuration is: Ubuntu 10.04 (64-bit), GeForce GTX 260, original NVIDIA driver packaged by Ubuntu, PGI Accelerator Fortran Workstation 10.5.

Is this a compiler bug? If so, is there workaround while still allocating variables declared in separate modules?

Regards,

Claude Knaus

MatColgrove · November 9, 2010, 9:16pm

Hi Claude,

In CUDA Fortran, device module data is only accessible by device routines within the same module or from host code that uses the module. Accessing device data declared from other modules is not allowed. (See the “Variable Qualifier” section of the CUDA Fortran Reference Guide)The problem being that there isn’t a linker for device code, hence no way to associate external device symbols.

However, this has been our most requested feature and our engineers have been hard at work trying to find a way to support this. With Fermi and CUDA 3.0, we have found a way to perform this association at run time. The feature is still in development but will be available some time early next year. Full details can be found at Account Login | PGI

Mat

cknaus1 · November 9, 2010, 11:21pm

Hi Mat,

Thanks for the prompt and clarifying answer!

Cheers,
– Claude

Topic		Replies	Views
device variable in module Legacy PGI Compilers	7	14700	April 13, 2015
CUDA Fortran : device variable in module Legacy PGI Compilers	3	9374	October 8, 2009
CUDA Fortran : device variable in module Legacy PGI Compilers	7	17429	February 19, 2010
cudaSetDevice seems completely broken Legacy PGI Compilers	12	15901	December 30, 2010
Problem with PGI 10.2 Legacy PGI Compilers	3	2953	February 8, 2010
cuda fortran module data Legacy PGI Compilers	6	8152	September 9, 2010
Problem with Using Allocatable Device Variable inside Module Legacy PGI Compilers	6	4207	May 13, 2019
Device functions in separate modules Legacy PGI Compilers	1	5749	February 23, 2010
device data on a different module Legacy PGI Compilers	5	3391	March 12, 2012
CUD Fortran - Device allocatable variable in and c_f_pointer Legacy PGI Compilers	2	3674	April 15, 2011

Cannot allocate variables from used modules

– Value of cudathreadsynchronize = 4 copyout Memcpy (host=0x7ffffa7af254, dev=0x110000, size=4) FAILED:4

Related topics

–
Value of cudathreadsynchronize = 4
copyout Memcpy (host=0x7ffffa7af254, dev=0x110000, size=4) FAILED:4