I have a double precision pgfortran code which is accelerated (mainly) by openacc directives. In order to check performance for single precision (SP) I changed in the code
the definition for kind (1.e0) and added to the following flags to the compilation command:
-pc 32 -r4 -Mfprelaxed
Is it correct?
Now, when I run the SP version I get NaN’s, but I don’t know how to do debugging for the GPU