Understanding "No device symbol for address reference" message

patrick.begou · April 29, 2022, 3:01pm

I am stuck with a “No device symbol for address reference” message with nvfortran from nvhpc/22.3. I try to offload a small piece of code, letting the compiler doing it’s defaults work in a first time:

do n=1,grid%nel_grps
     el_grp => grid%el_grps(n)%ptr
     pair2node1_val => el_grp%pair2node1%val
     pair2node2_val => el_grp%pair2node2%val
     r1_ptr_val => data_ptr%r1_ptrs(n)%ptr%val
     sym_op_val => sym_op_ptr%r1_ptrs(n)%ptr%val
     prod_r1_val => product_ptr%r1_ptrs(n)%ptr%val
     prod_r1_val(1:el_grp%nnode) = 0.0_WP
     niter = el_grp%npair

!$omp target
!$omp parallel do
     do ip=1,niter
          ino1 = pair2node1_val(ip)
          ino2 = pair2node2_val(ip)
          coeff = sym_op_val(ip)*(r1_ptr_val(ino2)-r1_ptr_val(ino1))
!$omp atomic
          prod_r1_val(ino1) = prod_r1_val(ino1) + coeff
!$omp atomic
          prod_r1_val(ino2) = prod_r1_val(ino2) - coeff
     end do
!$omp end target
     ...... 
end do

All val attributes are one dimensional double precision or integer allocatable arrays in user defined type.
Compilation with -Minfo shows data movements (with tofrom and correct shape).

Generating implicit map(tofrom:pair2node1_val(:),sym_op_val(:),r1_ptr_val(:),prod_r1_val(:),pair2node2_val(:))

But compilation abort with:

NVFORTRAN-W-0155-Compiler failed to translate accelerator region (see -Minfo messages): No device symbol for address reference

And I do not where to track this problem. Compilation is:

/opt/nvidia/hpc_sdk/Linux_x86_64/22.3/comm_libs/mpi/bin/mpifort -c -O1 -mp=gpu -gpu=cc80 -target=gpu -Minfo=accel  ....

Thanks for any suggestion.

Patrick

MatColgrove · April 29, 2022, 5:19pm

Hi Partick,

The error means that the compiler can’t find a device symbol for one or more of the pointers. Though I don’t know exactly what’s causing it. Can you please provide a reproducing example so I can investigate?

Thanks,
Mat

patrick.begou · April 29, 2022, 7:56pm

Hi Mat,
I will try to create simplified user defined type just involving the attributes used there.
Is there a way to know which device symbol is not found ?
Regards
Patrick

patrick.begou · May 4, 2022, 9:29am

Hi Mat,
I’ve worked on this problem simplifying more and more the code until it does nothing interesting but shows the problem with the No device symbol message. The short test case is attached:
defs_m.f90 (977 Bytes)
linear_solver_mat_op_m.f90 (1.6 KB)
Makefile (684 Bytes)
It does’nt build an executable (no main program is provided).
A minimal offloaded kernel is implemented in a module in linear_solver_mat_op_m.f90. This module uses definitions from another module implemented in defs_m.f90 (in the real code, here they are not needed)
If in the defs_m.f90 module file I remove line 20:

     20       !$OMP THREADPRIVATE(nsolver,current_solver,debug_level,dummy_int,dummy_real)

compilation is successfull. If the line is present I have the No device symbol error, even if in this case the variables in the threadprivate directive are not used here (but in the real code, mixing MPI and OpenMP I need them).
An idea abou this ?
Patrick

MatColgrove · May 4, 2022, 5:46pm

Thanks Patrick, this is helpful and I can reproduce the error.

I suspect what’s going on is when the compiler outlines the target region (outlining basically creates a function that’s then passed to the runtime), it’s also bringing over the module variables. Because of the outlining, it doesn’t know if they are used or not. But since these are threadprivate, the actual reference in the module is different than the one used at runtime.

The work around is to use the “loop” construct which doesn’t outline:

!$omp target teams map(tofrom:coeff)
!$omp loop
                               do ip=1,niter
                                  coeff(ip) = ip
                               end do
!$omp end target teams

I filed TPR #31776 and sent it to engineering for review.

-Mat

Topic		Replies	Views
Just released: HPC SDK 24.1 nvc, nvc++ and nvfortran	6	451	February 28, 2024
Trouble with a simple mpif90 nvc, nvc++ and nvfortran	10	781	December 4, 2023
Nvfortran compilation error for stdpar nvc, nvc++ and nvfortran	6	45	January 27, 2025
Omp target data use_device_ptr vs use_device_addr nvc, nvc++ and nvfortran	12	490	January 27, 2025
OpenMP: unsupported opcode=OMPTARGETDATA nvc, nvc++ and nvfortran nvcc	5	819	February 7, 2025
Setting a pointer inside a cuda fortran kernel nvc, nvc++ and nvfortran	4	24	April 11, 2025
Moving device data nvc, nvc++ and nvfortran	3	765	October 5, 2021
Compilation error for nested device subroutines with constant module data nvc, nvc++ and nvfortran	1	16	September 16, 2024
NVFORTRAN-S-0038-Symbol problem nvc, nvc++ and nvfortran	5	1724	August 20, 2022
Link error: undefined reference to 'pgf' Legacy PGI Compilers	4	1078	November 14, 2023

Understanding "No device symbol for address reference" message

Related topics