error: identifier "__hadd2" is undefined

Joseph_A · March 24, 2020, 1:39am

Hi,

I’m trying to compile the following program on our DGX2 machine using the PGI compiler 19.10.
We want to use the half2 vector datatype, but it does not compile:

main.cu(16): error: identifier "__hadd2" is undefined

Is something wrong in our cluster/makefile setup?

Thank you for your help

main.cpp

#include <iostream>
#include <cuda_fp16.h>
using namespace std;

__global__
void halfTest(int N, const half *x, half *y)
{
   int start = threadIdx.x + blockDim.x * blockIdx.x;
   int stride= blockDim.x * gridDim.x;
   int n2 = N/2;
   half2 *x2 = (half2*)x;
   half2 *y2 = (half2*)y;

   for(int i=start;i<n2;i+=stride)
     y2[i] = __hadd2(x2[i], y2[i]);

}

int main()
{
    cout << "Programstart.\n";
    const int N = 1e6;
    half *x = new half[N];
    half *y = new half[N];
    for(int i=0;i<N;i++)
    {
      x[i] = 1.0;
      y[i] = 0.0;
    }

    #pragma acc data copyin(x[:N],y[:N]) copyout(x[:N],y[:N])
    {
     #pragma host_data use_device(x,y)
     {
        halfTest<<<1,1>>>(N,x,y);
     }
    }


    return 0;
}

makefile:

ARCH = -ta=tesla:cc70
CC   = mpicc
CCU  = nvcc -ccbin=mpic++

RM   = /bin/rm
PROG = run

OBJS = main.o
OPTS =  ${ARCH} -acc -Minfo=accel -Minfo -Mcuda

%.o : %.c
        ${CC}  ${OPTS} -c ${CFLAGS} $<
%.o : %.cu
        ${CCU} -Xcompiler "${OPTS}" -c ${CUFLAGS} $<

all : ${PROG}
${PROG} : ${OBJS}
        mpic++ ${OPTS} -o $@ ${OBJS} ${LDFLAGS} ${CFPMODEL} ${libs}

clean :
        ${RM} -f ${PROG} *.o *~

MatColgrove · March 24, 2020, 2:57pm

Hi Peter,

I believe nvcc defaults to targeting older devices that don’t support half precision. Try setting the gpu architecture to CC70.

% nvcc main.cu
main.cu(15): error: identifier "__hadd2" is undefined

1 error detected in the compilation of "/tmp/tmpxft_000020b0_00000000-8_main.cpp1.ii".
% nvcc main.cu --gpu-architecture=compute_70
%

Hope this helps,
Mat

Joseph_A · March 26, 2020, 1:40am

mkcolg:

Hi Peter,

I believe nvcc defaults to targeting older devices that don’t support half precision. Try setting the gpu architecture to CC70.
% nvcc main.cu
main.cu(15): error: identifier "__hadd2" is undefined

1 error detected in the compilation of "/tmp/tmpxft_000020b0_00000000-8_main.cpp1.ii".
% nvcc main.cu --gpu-architecture=compute_70
%
Hope this helps,
Mat

Thank you for your answer. It worked.

Topic		Replies	Views
Error: identifier "__hdiv" is undefined when include cuda_fp16.h in .cu CUDA NVCC Compiler	4	1355	August 5, 2024
error using half2 arithmetic on TX2 Jetson TX2	1	646	March 1, 2018
__hadd not working correctly CUDA Programming and Performance cuda	3	412	October 19, 2023
Nvcc(cuda 11.6) compiled failed: __hmax undefined CUDA NVCC Compiler	15	1262	August 18, 2023
Identifier "__HALF2_TO_UI" is undefined when using asm for cuda CUDA Programming and Performance cuda	18	707	October 27, 2023
How to cuda half and half functions CUDA Programming and Performance	5	4155	January 10, 2019
Error: unknown type name ‘half’ CUDA Programming and Performance compile	5	1905	April 23, 2020
Problem manipulating half precision variables in CUDA kernel Jetson TX2	4	1565	October 18, 2021
cuda error for make. error fix GPU-Accelerated Libraries	0	789	June 7, 2019
OpenACC-CUDA interoperability within the same file Legacy PGI Compilers	4	4231	November 4, 2016

error: identifier "__hadd2" is undefined

Related topics