Cuda-gdb freezes when binary compiled with -stdpar -std=c++20 flag

jakob.niessner · November 17, 2025, 11:07am

I have compiled the following code

#include <iostream>
#include <algorithm>
#include <execution>
#include <numeric>

int main(){
int number_vals = 130’000’000;
double* Data = new double[number_vals]{};
// std::fill(std::execution::par_unseq, Data, Data+number_vals, 1.);

double result = std::reduce(std::execution::par_unseq, Data, Data+number_vals, 0.0, std::plus<>{});
std::cout << “Hello World!”<<std::endl;
std::cout << "The result is: "<< result<<std::endl;

}

using the command:

nvc++ -std=c++20 -stdpar main.cc -g.

but when I start the resulting binary in cuda-gdb the application freezes at startup. The problem does not seem to occur when I do not use the flag -std=c++20. I am working with the latest version of the HPC SDK on Ubuntu 24.04 using an rtx-4090

nvidia-smi output:

±----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 2032 G /usr/lib/xorg/Xorg 9MiB |
| 0 N/A N/A 2186 G /usr/bin/gnome-shell 10MiB |
±----------------------------------------------------------------------------------------+

MatColgrove · November 17, 2025, 11:34pm

Hi jakob.niessner and thanks for the report!

I was able to recreate the issue here. Looks to be getting stuck when trying to acquire a lock. My guess is that there’s some change in the header files, but have asked engineering to investigate. For reference, I reported this issue as TPR #37993.

-Mat

MatColgrove · January 28, 2026, 10:55pm

Hi jakob.niessner

FYI, TPR #37993 has been fixed in our 26.1 release.

-Mat

Topic		Replies	Views
Nvc++ compiler internal error with C++20 standard parallelism nvc, nvc++ and nvfortran	1	870	June 30, 2022
Issues with stdpar using nvc++ on Grace-Hopper nvc, nvc++ and nvfortran	2	38	March 23, 2026
Stdpar -- Floating point exception nvc, nvc++ and nvfortran	2	889	March 7, 2021
Std::transform_reduce incompatible with nvc++ -stdpar=gpu nvc, nvc++ and nvfortran algorithm	1	605	December 1, 2022
Can't run simple std::par program nvc, nvc++ and nvfortran	3	327	May 27, 2024
Cannot open STL source file "concepts" when compiling stdexec example code using nvc++ nvc, nvc++ and nvfortran	6	795	January 2, 2024
Nvc++ stdpar compilation and linking problems nvc, nvc++ and nvfortran	2	787	October 12, 2021
Compiling a Catch2 application with nvcc -std=c++20 leads a crash in cudafe++ CUDA NVCC Compiler	4	901	August 1, 2023
LLVM Error when compiling C++ STD parallel execution policies to GPU nvc, nvc++ and nvfortran	9	726	May 2, 2024
Nvc++ & external CUDA-thrust conflicts for -stdpar offload nvc, nvc++ and nvfortran	5	556	December 12, 2022

Cuda-gdb freezes when binary compiled with -stdpar -std=c++20 flag

Related topics