I’m developing a Fortran GPU code with mix usage of cuda Fortran and OpenACC on Windows 10 with PGI 19.10 community version. All my data arrays are allocated with cuda fortran allocate statement. The allocatable arrays are defined in a separated module. The code compiles and runs with correct results. But today after a rebuild, suddenly the code cannot run and gives me following error.
0: ALLOCATE: copyin Symbol Memcpy FAILED:13(invalid device symbol)
I’m relatively new to cuda fortran and OpenACC. I don’t know what this error means and how to fix it.
My GPU is Tesla K40. The compiler options I used is:
F90FLAGS =-g -acc -ta=tesla:cuda10.1 -Minfo=accel
CUDAFLAGS = -Mcuda=cc35,rdc -Mcudalib=cublas
Any help and advice will be deeply appreicated.