A question from compiling: fort2 TERMINATED by signal 6

SkyCool · April 8, 2026, 1:19pm

Hi,

I’m using NVFortran with OpenACC and experiencing very slow compilation times. When I use !$acc routine seq on functions that are called frequently within kernels, the compiler takes several minutes to finish, even though the code compiles successfully without errors. I have attached a picture to illustrate this.

Additionally, I encountered an error: corrupted size vs. prev_size

nvfortran-Fatal-/opt/nvidia/hpc_sdk/Linux_x86_64/25.7/compilers/bin/tools/fort2 TERMINATED by signal 6

This occurred after a long wait. I found that changing -fast to -O2 in the Makefile allows the compilation to succeed, but it still takes a long time.

Thanks!

MatColgrove · April 8, 2026, 3:33pm

Hi SkyCool,

Do you have a reproducing example you can share so I can investigate?

From your description, it seems like the compiler is using a lot of memory which would cause slow compilation and possibly the abort signal. Possibly due to inlining, but I can’t be sure.

I only see one similar issue report, but that was fixed back in 2020. Might be related, but I’d need to reproduce the issue to tell.

Try adding “-Mnoinline” to see if that helps. Also the flag “-time” will have the compiler driver print out the time for each phase of the compilation so we can see where the time is coming from.

Also, you can try using our latest compiler release, 26.3, to see if that helps.

-Mat

SkyCool · April 9, 2026, 2:23am

Dear Mat,

Thanks for your replies. I have tried your suggestions to my Makefile, and now the compiler works but the compiling speed has not been improved. After adding the “-time“, I obtained some statistics of the compiling time for two SUBROUTINEs that contain the FUNCTION mentioned above.

And I am sorry to provide a complete code, because I cannot split it from a project. But I can share the part of KERNEL:

!$acc kernels async(11)
!$acc loop private(x_pos,y_pos,z_pos,idx_num,idx1_num,idx2_num,idx3_num,idx4_num,&
!$acc              Forward_loop,Ratio_Rec_IP_1,Ratio_Rec_IP_2,Ratio_Rec_IP_3,&
!$acc              Ratio_Rec_IP_4,Ratio_Rec_IP_5,TEMP_SIG,TEMP_IPM,TEMP_IPT,&
!$acc              TEMP_IPC,DELX1,DELY1,DELZ1)

DO I=1,Model_Num
    !! Calculate the first term of gradient
    val_temp=0.0D0
    
    !$acc loop seq
    DO jj=1,4
        idx_num  = Field_Idx_Rec(I,jj)
        idx1_num = Field_Idx1_Rec(I,jj)
        idx2_num = Field_Idx2_Rec(I,jj)
        idx3_num = Field_Idx3_Rec(I,jj)
        idx4_num = Field_Idx4_Rec(I,jj)

        x_pos = Field_Coord(idx_num)%Coordmesh_X
        y_pos = Field_Coord(idx_num)%Coordmesh_Y
        z_pos = Field_Coord(idx_num)%Coordmesh_Z

        DELY1 = (CDELY(y_pos-1)+CDELY(y_pos))/2.0D0
        DELZ1 = (CDELZ(z_pos-1)+CDELZ(z_pos))/2.0D0

        TEMP_SIG = CCSIG(x_pos,y_pos-1,z_pos-1)*CDELY(y_pos-1)*CDELZ(z_pos-1)&
                 &+CCSIG(x_pos,y_pos-1,z_pos  )*CDELY(y_pos-1)*CDELZ(z_pos  )&
                 &+CCSIG(x_pos,y_pos  ,z_pos-1)*CDELY(y_pos  )*CDELZ(z_pos-1)&
                 &+CCSIG(x_pos,y_pos  ,z_pos  )*CDELY(y_pos  )*CDELZ(z_pos  )

        TEMP_IPM = IPM(x_pos,y_pos-1,z_pos-1)*CDELY(y_pos-1)*CDELZ(z_pos-1)&
                 &+IPM(x_pos,y_pos-1,z_pos  )*CDELY(y_pos-1)*CDELZ(z_pos  )&
                 &+IPM(x_pos,y_pos  ,z_pos-1)*CDELY(y_pos  )*CDELZ(z_pos-1)&
                 &+IPM(x_pos,y_pos  ,z_pos  )*CDELY(y_pos  )*CDELZ(z_pos  )

        TEMP_IPT = IPT(x_pos,y_pos-1,z_pos-1)*CDELY(y_pos-1)*CDELZ(z_pos-1)&
                 &+IPT(x_pos,y_pos-1,z_pos  )*CDELY(y_pos-1)*CDELZ(z_pos  )&
                 &+IPT(x_pos,y_pos  ,z_pos-1)*CDELY(y_pos  )*CDELZ(z_pos-1)&
                 &+IPT(x_pos,y_pos  ,z_pos  )*CDELY(y_pos  )*CDELZ(z_pos  )

        TEMP_IPC = IPC(x_pos,y_pos-1,z_pos-1)*CDELY(y_pos-1)*CDELZ(z_pos-1)&
                 &+IPC(x_pos,y_pos-1,z_pos  )*CDELY(y_pos-1)*CDELZ(z_pos  )&
                 &+IPC(x_pos,y_pos  ,z_pos-1)*CDELY(y_pos  )*CDELZ(z_pos-1)&
                 &+IPC(x_pos,y_pos  ,z_pos  )*CDELY(y_pos  )*CDELZ(z_pos  )

        TEMP_SIG = TEMP_SIG/(4.0D0*DELY1*DELZ1)
        TEMP_IPM = TEMP_IPM/(4.0D0*DELY1*DELZ1)
        TEMP_IPT = TEMP_IPT/(4.0D0*DELY1*DELZ1)
        TEMP_IPC = TEMP_IPC/(4.0D0*DELY1*DELZ1)

        Forward_loop = Loop-1

        ! Complex expressions for Ratio_Rec_IP_1 to Ratio_Rec_IP_5
        ! (involving GAMMA, digamma, log functions)
        ...

        val_temp = val_temp + Ratio_Rec(I,jj)&
                 &*(Ratio_Rec_IP_1*Forward_fields(idx_num, Loop-Field_Rec_Idxstart+1)&
                 & -Ratio_Rec_IP_2*Forward_fields(idx1_num,Loop-Field_Rec_Idxstart+1-1)&
                 & -Ratio_Rec_IP_3*Forward_fields(idx2_num,Loop-Field_Rec_Idxstart+1-1)&
                 & +Ratio_Rec_IP_4*Forward_fields(idx3_num,Loop-Field_Rec_Idxstart+1-1)&
                 & +Ratio_Rec_IP_5*Forward_fields(idx4_num,Loop-Field_Rec_Idxstart+1-1))&
                 &*Field_False_Rec(idx_num)
    ENDDO

    !$acc loop seq
    DO jj=5,8
        ! Similar structure with DELX1, DELZ1
        ...
    ENDDO
ENDDO
!$acc end kernels

In my MODULE, there are six SUBROUTINEs. Each SUBROUTINE has such complex formula to call the FUNCTION.

The function “digamma“ is defined as

function digamma(c) result(psi)
    implicit none
    !$acc routine seq
    real(8), intent(in) :: c
    real(8) :: psi
    real(8), parameter :: gamma_e = 0.5772156649015328606d0
    integer :: i, N
    real(8) :: x, sum

    ! if (c <= 0.d0 .or. c >= 1.d0) then
    !     write(*,*) "c must be in (0,1)"
    !     stop
    ! end if

    x = c
    N = 60
    sum = 0.d0

    do i = 1, N
        sum = sum + x / ( dble(i) * (dble(i) + x) )
    end do

    psi = -gamma_e - 1.d0/x + sum

end function digamma

MatColgrove · April 9, 2026, 9:03pm

What’s the overall time for compilation?

The “-time” output doesn’t look bad as it’s 9 and 16 seconds, which seems reasonable. The extra time could be coming from the device code generator, which wouldn’t be included in this output. Though I’m not sure.

Are you able to provide the full project? If it’s not something you can share on an open forum, please feel free to send be a direct message with details. To send a direct message, click on my account name and, on my profile, look for the “message” button in the upper-right.

Unfortunately since I can’t compile the code snip-its, I can’t investigate what the issue would be.

-Mat

Topic		Replies	Views
Fortran OpenACC program compiled with nvfortran -O2 crashes, but -O1 works nvc, nvc++ and nvfortran	3	400	December 13, 2023
Fort2 TERMINATED by signal 11 nvc, nvc++ and nvfortran	1	68	September 19, 2025
Nvfortran 25.11: internal compiler error with OpenACC code nvc, nvc++ and nvfortran	5	77	February 11, 2026
-fast compiler instruction is producing incorrect results. Legacy PGI Compilers	8	7114	May 2, 2014
Compiler failed to translate accelerator region (see -Minfo messages): Unexpected address of constant nvc, nvc++ and nvfortran	10	1853	June 3, 2021
Problem:Fortran code with open ACC doesn't gain any speed up Legacy PGI Compilers	8	6767	February 12, 2014
Getting different results when using pgfortran vs. gfortran for same exact problem nvc, nvc++ and nvfortran	3	880	December 20, 2021
Openacc, command exited with non_zero status 1 nvc, nvc++ and nvfortran cuda , ubuntu	19	1586	October 10, 2021
Internal compiler error with nvfortran 22.9 nvc, nvc++ and nvfortran	4	597	November 16, 2022
Bug in nvfortran 22.1-0 64-bit target on x86-64 Linux -tp zen3 nvc, nvc++ and nvfortran	2	476	August 12, 2022

A question from compiling: fort2 TERMINATED by signal 6

Related topics