Compiling a Catch2 application with nvcc -std=c++20 leads a crash in cudafe++

fwyzard · May 31, 2023, 3:05pm

Compiling the main() part of a Catch2 application with nvcc in c++20 mode leads to memory exhaustion inside cudafe++, likely due to an infinite loop.

A reproducer can be as simple as test.cu:

#define CATCH_CONFIG_MAIN
#include <catch2/catch.hpp>

Compiling with nvcc in c++17 mode works fine:

/usr/local/cuda-12.1/bin/nvcc -std=c++17 test.cu -c -o test.o

Compiling with nvcc in c++20 seems to hang, and is eventually killed:

/usr/local/cuda-12.1/bin/nvcc -std=c++20 test.cu -c -o test.o
Killed

Investigatinh with nvcc -v -keep shows that the problem is in the cudafe++ step:

/usr/local/cuda-12.1/bin/nvcc -std=c++20 test.cu -c -o test.o -v -keep -keep-dir tmp
...
gcc -std=c++20 -D__CUDA_ARCH_LIST__=520 -E -x c++ -D__CUDACC__ -D__NVCC__  "-I/usr/local/cuda-12.1/bin/../targets/x86_64-linux/include"    -D__CUDACC_VER_MAJOR__=12 -D__CUDACC_VER_MINOR__=1 -D__CUDACC_VER_BUILD__=105 -D__CUDA_API_VER_MAJOR__=12 -D__CUDA_API_VER_MINOR__=1 -D__NVCC_DIAG_PRAGMA_SUPPORT__=1 -include "cuda_runtime.h" -m64 "test.cu" -o "tmp/test.cpp4.ii"
cudafe++ --c++20 --gnu_version=110300 --display_error_number --orig_src_file_name "test.cu" --orig_src_path_name "/home/fwyzard/src/nvidia_bug_nnnnnnnn/test.cu" --allow_managed  --m64 --parse_templates --gen_c_file_name "tmp/test.cudafe1.cpp" --stub_file_name "test.cudafe1.stub.c" --gen_module_id_file --module_id_file_name "tmp/test.module_id" "tmp/test.cpp4.ii"
Killed
# --error 0x89 --

The last line of tmp/test.cudafe1.cpp is over 300 MB of repeating std::remove_cv_t< const std::remove_cv_t< const std::remove_cv_t< const ..., which points to some kind of infinite loop inside cudafe++.

fwyzard · May 31, 2023, 3:16pm

Submitted as NVIDIA bug #4139863.
For a trivial reproducer, see GitHub - fwyzard/nvidia_bug_4139863: Simple reproducer for NVIDIA bug #4139863 .

andreas.henne · August 1, 2023, 1:25pm

We have exactly the same problem. We are using catch and with C++17 it works. With C++20, cudafe++ seems to be stuck in an endless loop. Did you find any workaround by any chance or is there anything new related to this problem? Unfortunately, this prevents us from switching to C++20 right now. :-(

fwyzard · August 1, 2023, 1:40pm

Unfortunately this is still a problem with CUDA 12.2.1 .

fwyzard · August 1, 2023, 1:42pm

The workaround we are using is to move the “main” part defined by

#define CATCH_CONFIG_MAIN
#include <catch2/catch.hpp>

into a .cc file, and implement only the tests in the .cu files:

#include <catch2/catch.hpp>

...

Then, the main part and the tests can be linked together with g++.

In this way nvcc and cudafe++ never see the “main” part.

Topic		Replies	Views
CUDACOMPILE : nvcc error : 'cudafe++' died with status 0xC0000409 CUDA NVCC Compiler	16	5996	June 3, 2024
Adding thrust headers & Cuda 13 update 2 Visual Studio 2022 Version 17.14.21 throws CUDACOMPILE : nvcc error : 'cudafe++' died with status 0xC0000409 CUDA NVCC Compiler cuda	0	68	November 21, 2025
Error 'cudafe++' died with status 0xC0000409 CUDA Programming and Performance cuda	3	951	September 25, 2024
Windows command line NVCC compilation error 'cudafe++' died with status 0xC0000005 (ACCESS_VIOLATION) CUDA Programming and Performance cuda , nvcc	1	1561	January 26, 2024
Cuda-gdb freezes when binary compiled with -stdpar -std=c++20 flag nvc, nvc++ and nvfortran kernel	2	33	January 28, 2026
C++20 user defined literals in nvcc 12.0 bug CUDA NVCC Compiler cuda , nvcc	7	1963	October 19, 2023
permanent CUDAFE crashes due to 0x0 memory reference CUDA Programming and Performance	6	1694	January 19, 2015
"cudafe.exe has stopped working" - vs2008 CUDA Programming and Performance	3	16387	April 4, 2009
Cudafe++ runs out of memory and crashes during compilation CUDA NVCC Compiler	1	620	September 30, 2023
Error compiling cuFFTDx code: ‘cudafe++’ died with status 0xC0000409 GPU-Accelerated Libraries cufft , cutlass	2	166	October 8, 2024

Compiling a Catch2 application with nvcc -std=c++20 leads a crash in cudafe++

Related topics