Fortran OpenACC code compiles, but does not use the device

e.moravveji · September 3, 2017, 4:23pm

Hi.

I am experimenting with basic OpenACC Fortran features. I have prepared a very simple example of computing a 2D Gaussian surface using derived types, and trying to transfer data between the host and the device. This example is modular and is distributed across multiple files on purpose (in contrast to all example codes I have seen so far). You may find the source code and the Makefile here: OpenACC/derived_types at master · moravveji/OpenACC · GitHub

The problem is: the code neatly compiles and runs. However, I never see the expected compiler feedback talking about data movements, and the generation of device kernel codes. Thus, the code is only running on the host, ignoring all !$acc directives. See below:

$> make clean; make
rm -f *.mod *.o *~ *.exe
pgfortran -c  -o io.o io.f90
pgfortran -c  -o vars.o vars.f90
pgfortran -c  -o kern.o kern.f90
pgfortran -c  -o main.o main.f90
pgfortran -acc -ta=tesla:cuda8.0,cc35 -o drv_types.exe io.o vars.o kern.o main.o -Minfo=all 

./drv_types.exe

I have two guesses: (1) either I am messing something up in my Makefile or any of the Fortran modules (which I cannot spot readily), or (2) there are yet additional compiler flags to set when the OpenACC directives are used across one/multiple modules/source files. Or perhaps something else is the reason.

I use PGI-17.4 with K40c NVIDIA device on a Westmere node. The following environment variables are also set for extra feedback:

export ACC_DEVICE_TYPE=nvidia
export PGI_ACC_NOTIFY=1
export PGI_ACC_TIME=1

I would be glad if some one of the accelerator black belts point out how to fix this compilation issue.

Regards,
Ehsan.

MatColgrove · September 5, 2017, 2:44pm

Hi Ehsan,

It looks like that you only have the flags that enable OpenACC on your link line, not the compilation. Hence, the compiler is ignoring the OpenACC directives. Try adding “-acc -ta=tesla:cuda8.0,cc35 -Minfo=accel” to your compilation flags in your Makefile.

Hope this helps,
Mat

e.moravveji · September 5, 2017, 6:20pm

Thanks Mat,

Problem solved.

Regards,
Ehsan.

Topic		Replies	Views
Problem with fixed-form Fortran OpenACC, daxpy Legacy PGI Compilers	2	2590	December 10, 2015
Compiling for Both OpenACC and CUDA Fortran Legacy PGI Compilers	4	7200	September 11, 2014
pgc++ -c -acc failed to compile with -O2 Legacy PGI Compilers	2	2542	August 26, 2019
Device Debugging with Allinea DDT Legacy PGI Compilers	6	7054	August 7, 2015
Strange error Legacy PGI Compilers	3	910	July 6, 2020
OpenACC with cuBLAS and cuSPARSE in Fortran code Legacy PGI Compilers	7	8443	February 22, 2016
how to compil CUDA device functions Legacy PGI Compilers	10	5024	August 29, 2018
Issue with acc_memcpy_device Legacy PGI Compilers	3	2144	August 19, 2019
NVFORTRAN-S-0155-Could not resolve generic procedure for cublas nvc, nvc++ and nvfortran	1	34	July 28, 2024
simple multi-gpu test program not working Legacy PGI Compilers	4	4093	June 14, 2013

Fortran OpenACC code compiles, but does not use the device

Related topics