CUDA 12.6U1 NVCC Regression

frenzi · September 10, 2024, 3:19am

After an update from CUDA Toolkit 12.6 to 12.6U1, I am unable to compile previously working code. I get the following new error:

[build] nvcc warning : incompatible redefinition for option 'optimize', the last value of this option was used
[build] /usr/lib/gcc/x86_64-linux-gnu/11/include/amxtileintrin.h(42): error: identifier "__builtin_ia32_ldtilecfg" is undefined
[build]     __builtin_ia32_ldtilecfg (__config);
[build]     ^
[build] 
[build] /usr/lib/gcc/x86_64-linux-gnu/11/include/amxtileintrin.h(49): error: identifier "__builtin_ia32_sttilecfg" is undefined
[build]     __builtin_ia32_sttilecfg (__config);
[build]     ^
[build] 
[build] 2 errors detected in the compilation of "/home/bryce/dev/torch-discounted-cumsum-nd/torch_discounted_cumsum_nd/operator.cu"

OS = Ubuntu 24.04
NVCC = cuda_12.6.r12.6/compiler.34714021_0
GCC = 13.2 and 11.4

The source code can be found here. Requires pytorch >=2.4 as noted in the readme. I’m just on a Ryzen 5950X, I don’t even have AMX extensions.

Topic		Replies	Views
Compilation Errors with GCC Versions 11-14 and CUDA Toolkit 12.5/12.6 Due to Undefined `__builtin_ia32_ldtilecfg` and `__builtin_ia32_sttilecfg`, etc GPU-Accelerated Libraries cuda	2	1367	October 15, 2024
__builtin_ia32_ldtilecfg and __builtin_ia32_sttilecfg are undefined CUDA NVCC Compiler	6	1243	July 12, 2024
NVCC (v12.6) fails to compile Qiskit-AER with error: identifier "__builtin_ia32_ldtilecfg" is undefined CUDA NVCC Compiler	0	91	September 18, 2024
CUDA compile error with nvhpc 22.11 nvc, nvc++ and nvfortran	2	587	January 9, 2024
Nvc++ with cuda 12.0 nvc, nvc++ and nvfortran	6	1710	January 13, 2023
FC11 x86_64: __sync_fetch_and_add error... compiler error with nvcc... CUDA Programming and Performance	3	5322	November 30, 2009
CUDA 12.4 nvcc and GCC 14.1 incompatibility? CUDA NVCC Compiler linux	2	2625	May 17, 2024
ICC (intel compiler) compatibilities CUDA Programming and Performance	7	8765	November 11, 2011
Compile Issues Jetson TK1	5	11417	February 2, 2016
nvcc with avx support cannot find gcc builtin intrinsics CUDA Programming and Performance	5	3433	October 12, 2014

CUDA 12.6U1 NVCC Regression

Related topics