Mat (et al),
I just noticed that if I use -Mcuda and -Kieee together with 10.5, it works…or at least doesn’t complain like it did with 10.4 and below:
> pgfortran -V10.4 -Mcuda=keepgpu,keepbin,keepptx,fastmath,nofma -Kieee -fast -r4 -Mextend -Mpreprocess -Ktrap=fp -DFLXY -DDEG2 -c src/sorad.FLXY.play.cuf sorad.FLXY.play.001.gpu(252): error: expected a ")" ... cut a lot of these errors... sorad.FLXY.play.001.gpu(1176): error: expected a ")" 25 errors detected in the compilation of "/tmp/pgnvdddXgjVTRQ3JW.nv0". PGF90-F-0000-Internal compiler error. pgnvd job exited with nonzero status code 0 (src/sorad.FLXY.play.cuf: 1251) PGF90/x86-64 Linux 10.4-0: compilation aborted > pgfortran -V10.5 -Mcuda=keepgpu,keepbin,keepptx,fastmath,nofma -Kieee -fast -r4 -Mextend -Mpreprocess -Ktrap=fp -DFLXY -DDEG2 -c src/sorad.FLXY.play.cuf
Does this mean that -Kieee flag affects the CUDA code, or does it mean that host-code math contained within a .CUF can now be IEEE?
Or, should one even do this? Mix -Kieee with -Mcuda?
ETA: I just realized I have fastmath and Kieee and nofma. I think I’d need a flowchart to figure out what the different combinations of all those do…and if any of them are recommended or warned against!