Nvfortran (v22.11) + OpenACC giving inconsistent results with -O2 and -O3: How to best triangulate?

user94452 · August 5, 2023, 9:01pm

I’ve noticed that OpenACC + nvfortran is leading to some unexpected artifacts in my simulations for -O3 optimization but not -O2. The code is relatively long to track down the root of this difference by hand; we have dozens of OpenACC kernels. Is there a clean way to bisect where the issue could be coming from?

Right now, we are exploring turning on -O2 plus other options manually like -Munroll and such, but I’m not sure every difference between -O2 and -O3 is flippable via a flag (or is even documented, though some are).

bleback · August 6, 2023, 5:23pm

You might try PCAST, either comparing to the CPU, or saving results at -O2 and comparing to -O3. See HPC Compilers User's Guide Version 23.7 for ARM, OpenPower, x86

It is likely the order or operations changing either due to compiler optimizations or unrolling, which might also affect order of summations. But there are other possibilities, including bugs. You can also experiment with the -gpu options to narrow it down.

user94452 · August 6, 2023, 5:28pm

Thanks, @bleback - will report back on what we find most useful and if we suspect bugs.

Topic		Replies	Views
Strange -O3 optimization result for nvfortran nvc, nvc++ and nvfortran	2	536	July 22, 2021
Fortran OpenACC program compiled with nvfortran -O2 crashes, but -O1 works nvc, nvc++ and nvfortran	3	348	December 13, 2023
Segfault with nvfortran nvc, nvc++ and nvfortran	2	302	March 30, 2024
Questions about the -acc command Legacy PGI Compilers	3	405	January 6, 2024
Nvfortran: reducing optimation level by multiple -On does not work nvc, nvc++ and nvfortran	1	388	October 12, 2021
Regarding -acc flag on pgfortran Legacy PGI Compilers	3	730	October 12, 2021
PGF90- Internal compiler error Legacy PGI Compilers	1	2207	October 13, 2017
Seeing odd results from Nsight Compute when testing OpenACC vs OpenMP nvc, nvc++ and nvfortran	3	603	September 3, 2023
Can -acc generate different numerical results ? Legacy PGI Compilers	1	1286	March 25, 2019
Compiler optimization ? CUDA Programming and Performance	2	2057	June 9, 2008

Nvfortran (v22.11) + OpenACC giving inconsistent results with -O2 and -O3: How to best triangulate?

Related topics