hello ,I wondered why we should turn off the neon when Compile the code to consult the error ? Is it because the armv8 is 64-bit ,can’t run 32-bit simultaneously?
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1117): error: identifier "__builtin_aarch64_addhn2v8hi" is undefined
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1123): error: identifier "__builtin_aarch64_addhn2v4si" is undefined
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1129): error: identifier "__builtin_aarch64_addhn2v2di" is undefined
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1135): error: identifier "__builtin_aarch64_addhn2v8hi" is undefined
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1143): error: identifier "__builtin_aarch64_addhn2v4si" is undefined
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1151): error: identifier "__builtin_aarch64_addhn2v2di" is undefined
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1159): error: identifier "__builtin_aarch64_raddhn2v8hi" is undefined
/usr/lib/gcc/aarch64-linux-gnu/5/include/arm_neon.h(1165): error: identifier "__builtin_aarch64_raddhn2v4si" is undefined
Error limit reached.
100 errors detected in the compilation of "/tmp/tmpxft_00007538_00000000-6_elas_gpu.cpp1.ii".
Compilation terminated.
CMake Error at cuda_compile_generated_elas_gpu.cu.o.cmake:266 (message):
Error generating file
/home/data/libelas-gpu/build/CMakeFiles/cuda_compile.dir/GPU/./cuda_compile_generated_elas_gpu.cu.o
CMakeFiles/libelas_gpu.dir/build.make:70: recipe for target 'CMakeFiles/cuda_compile.dir/GPU/cuda_compile_generated_elas_gpu.cu.o' failed
make[2]: *** [CMakeFiles/cuda_compile.dir/GPU/cuda_compile_generated_elas_gpu.cu.o] Error 1
CMakeFiles/Makefile2:67: recipe for target 'CMakeFiles/libelas_gpu.dir/all' failed
make[1]: *** [CMakeFiles/libelas_gpu.dir/all] Error 2
Makefile:83: recipe for target 'all' failed
make: *** [all] Error 2
but I can’t close the neon ,because the project has sse instruction.And I convert the sse to neon.I wonder if I should change neon to aarch64 to solve the problem .But it doesn’t work.
Thank you for your help !
In the mean time, I would like to ask your help to see if we can have a simplified version of code to reproduce?
As you know, we may not able to help debug code from 3rd party. It would be easier if you can just write a very simple example with NEON enabled and confirmed you hit similar issue.
I’m sorry not to have the simplified version of code.t.And the sse_to_neon.hpp I can’t commit to github.
you can find it here https://github.com/otim/SSE-to-NEON
Thank you for your help.