Error with static inline in nvc

m96 · March 9, 2023, 9:18pm

Hi everyone,
I’m trying to compile an external library to my program with nvc/nvc++ but I get these errors:

I hope someone can help me

MatColgrove · March 9, 2023, 10:39pm

Likely due to this section in starting at line 1670 of “PacketMath.h”. We added support for the m128 intrinsics a bit ago so this redefinition is no longer needed.

However, it appears that the code does check the compiler version, so I’m not sure why these would still be included. Possibly due to our rebranding from PGI to NVHPC? Not sure.

You might want to ask the Eigen folks since they’ll have a better understand.

#if EIGEN_COMP_PGI && EIGEN_COMP_PGI < 1900
// PGI++ does not define the following intrinsics in C++ mode.
static inline __m128  _mm_castpd_ps   (__m128d x) { return reinterpret_cast<__m128&>(x);  }
static inline __m128i _mm_castpd_si128(__m128d x) { return reinterpret_cast<__m128i&>(x); }
static inline __m128d _mm_castps_pd   (__m128  x) { return reinterpret_cast<__m128d&>(x); }
static inline __m128i _mm_castps_si128(__m128  x) { return reinterpret_cast<__m128i&>(x); }
static inline __m128  _mm_castsi128_ps(__m128i x) { return reinterpret_cast<__m128&>(x);  }
static inline __m128d _mm_castsi128_pd(__m128i x) { return reinterpret_cast<__m128d&>(x); }
#endif

-Mat

m96 · March 10, 2023, 6:57pm

Hi Mat,
thanks for the reply I solved the problem by following your old post:
https://forums.developer.nvidia.com/t/error-compiling-with-eigen-library/136246

The program works now but I have one more question, in the early stages I’m using unified memory to avoid using explicit data copying. When I launch the program, it works but I have this statement:

memalign: call to cuMemAllocManaged returned error 1: Invalid value

Is there a way to figure out which value is creating the error? Because then if I try to add other openacc directives in the code I get segmentation errors and I would like to understand if it is that value that creates these problems for me.

MatColgrove · March 10, 2023, 8:23pm

thanks for the reply I solved the problem by following your old post:

Ah, yes. I knew I’d seen this before. It looks like they updated the code to at least attempt to only use this with older compiler versions, but I’m not sure how they are determining the “EIGEN_COMP_PGI” value.

Is there a way to figure out which value is creating the error?

You’ll likely need to run the code through a debugger, like gdb, and see if it interrupts on the error or if you can put in breaks to track it down. This wont show which variable is getting allocated, but might show the value being used.

You can try the ‘compute-sanitizer’ utility as well, but since this is coming from the host side, doubt it will be able to help.

When using the “-gpu=managed” flag, the compile will replace visible allocation calls (i.e. malloc, new, etc.) with calls to cudaMallocManaged. I don’t see any direct calls to memalign, so assume it gets called through one of the aligned malloc calls.

m96 · March 11, 2023, 3:23pm

Hi Matt,
I tried to use nvidia’s compute-sanitizer tool and

compute-sanitizer --log-file memory2 SU2_CFD inv_NACA0012.cfg

I got this as a result
memory2 (733.2 KB)

if i understand correctly, from what i found online, the Host_Frame report the path to the error?

MatColgrove · March 13, 2023, 4:53pm

I believe so. However when I grabbed “allocation_toolbox.hpp” from SU2 and put it in an example program, I was unable to reproduce the error.

I thought it might be a problem with how we’re replacing the aligned_alloc with aligned_alloc_managed, but it it doesn’t seems so, at least for generic use. Though there might be something specific which SU2 which is causing it.

I can try building and running SU2, but I’m a bit swamped right now, so may not be able to get to it any time soon. Hence, if you can do more analysis to determine why this is occurring, that would be appreciated.

-Mat

Topic		Replies	Views
NVC++-F-0000-Internal compiler error. msz_dtype, bad value 775 nvc, nvc++ and nvfortran	3	1178	June 26, 2020
Pgc++ can't running c++ code with eigen library nvc, nvc++ and nvfortran open-source-software	2	1135	December 4, 2021
Breaking change for Eigen code in NVCC 11.3 and later vs NVCC 11.2 and earlier CUDA NVCC Compiler	3	2134	August 17, 2021
Unable to link project when using nvc++ and nvc nvc, nvc++ and nvfortran hpc	13	1814	March 2, 2023
Possible bug in nvc++ 23.05 nvc, nvc++ and nvfortran	1	479	July 20, 2023
Undefined reference to `__builtin_ia32_palignr256' when calling _mm256_alignr_epi8 nvc, nvc++ and nvfortran	3	666	January 30, 2024
NVC++-F-0000-Internal compiler error. must have operand nvc, nvc++ and nvfortran nvbugs	9	983	November 18, 2024
LLVM Error when compiling C++ STD parallel execution policies to GPU nvc, nvc++ and nvfortran	9	659	May 2, 2024
Nvc++: undefined __kmpc_for_static_init_16 and Unexpected branch type nvc, nvc++ and nvfortran	7	465	April 2, 2024
CudaIllelgalMemoryaccess only in nvc++ with -O0 optimization level and -stdpar nvc, nvc++ and nvfortran	3	43	November 18, 2025

Error with static inline in nvc

Related topics