Strange -O3 optimization result for nvfortran

JieyunPan · April 27, 2021, 2:59am

I was porting an in-house fortran code to OpenACC by HPC SDK and got some strange result for -O3 optimization. The result can be reproduced by the following piece of code:

! bug_test.f90

program main

implicit none

integer i, na

real, allocatable :: w(:), ww(:)

real a

na = 8

allocate(w(na), ww(na))

w = 1.

ww = -1.

!$acc kernels

do i = 1, na

a = w(i)

w(i) = ww(i)

ww(i) = a

enddo

!$acc end kernels

write(*, *) w

write(*, *)

write(*, *) ww

end program

I need to sweep the elements in two arrays, when I compile the code with:

nvfortran -acc -Minfo -r8 -O3 bug_test.f90

The program output shows that both w and ww all “-1”

However, the program works well when compile with -O2 or -O.

The program even shows a right output for -O3 optimization while I targeting -acc=multicore or -acc=host. It seems that the compiler takes a strange optimization strategy for the GPU code. I have to avoid -O3 optimization in my code now.

My OS is Ubuntu 18.04.2 LTS, HPC SDK version is 21.3, cuda version is 11.0.

Best regards

MatColgrove · April 27, 2021, 7:31pm

Thanks JieyunPan,

I’ve reproduced the issue here and filed problem report TPR #29984.

-Mat

bleback · July 22, 2021, 10:41pm

This has been fixed in our 21.7 release which has just been made available.

Topic		Replies	Views
Possible NVFORTRAN optimization bug nvc, nvc++ and nvfortran nvbugs	2	482	April 20, 2021
Nvfortran (v22.11) + OpenACC giving inconsistent results with -O2 and -O3: How to best triangulate? nvc, nvc++ and nvfortran	2	377	August 6, 2023
Bug of nvfortran 22.2-0: array subscript triplet handled wrongly nvc, nvc++ and nvfortran nvbugs	4	983	June 9, 2022
Segfault with nvfortran nvc, nvc++ and nvfortran	2	302	March 30, 2024
PGF90- Internal compiler error Legacy PGI Compilers	1	2207	October 13, 2017
Fortran OpenACC program compiled with nvfortran -O2 crashes, but -O1 works nvc, nvc++ and nvfortran	3	348	December 13, 2023
SOLVED? nvcc optimization options problem CUDA Programming and Performance	5	7130	July 15, 2010
Disabling optimization on specific source files (nvc++) nvc, nvc++ and nvfortran	4	529	September 1, 2023
incorrect exponentiation result Legacy PGI Compilers	2	3551	June 19, 2008
[nvfortran] Assumed sized character variables inside OpenACC kernels regions nvc, nvc++ and nvfortran	3	327	November 2, 2023

Strange -O3 optimization result for nvfortran

Related topics