Multi threaded issue with --default-stream per-thread

eyalhir74 · November 15, 2018, 1:34pm

Hi,
I’m using the --default-stream per-thread compilation flag with multiple threads then using cudaStreamSynchronize(cudaStreamPerThread) to synchronize the appropriate stream.
It seems that the code hangs. Using gdb it seems the threads are all on cudaStreamSynchronize.

Anyone has an idea? is this a bug in this configuration?

thanks
Eyal

eyalhir74 · November 16, 2018, 8:38am

No one uses this feature??

Robert_Crovella · November 16, 2018, 3:31pm

In my experience you’re more likely to get help if you provide a short, complete test case demonstrating the issue.

I created a simple test case and it seems to work fine for me.

$ cat t325.cu
#include <iostream>
#include <pthread.h>

const size_t dt = 1000000000ULL;
const size_t nt = 4;

__global__ void k(){
  size_t start = clock64();
  while (clock64() < start+dt);
}

typedef struct {
} ptArgs;

static void* rt(void* args)
{
  k<<<1,1>>>();
  cudaStreamSynchronize(cudaStreamPerThread);
  std::cout<< "thread exiting" << std::endl;
  return NULL;
}

int main(int argc, char* argv[])
{
  pthread_t pt[nt];
  ptArgs args[nt];
  for (size_t t = 0; t < nt; ++t) {
    pthread_create(pt + t, NULL, &rt, (void*)(args + t));
    }
  std::cout << "threads created" << std::endl;
  for (size_t t = 0; t < nt; ++t) {
    pthread_join(pt[t], NULL);
    }
  return 0;
}
$ nvcc -o t325 t325.cu -lpthread --default-stream per-thread
$ vi t325.cu
$ cuda-memcheck ./t325
========= CUDA-MEMCHECK
threads created
thread exiting
thread exiting
thread exiting
thread exiting
========= ERROR SUMMARY: 0 errors
$

CUDA 10.0, CentOS7, Tesla P100

eyalhir74 · November 20, 2018, 9:10pm

Hi Robert,
Thanks for the answer. I was indeed unable to reproduce the issue with the code you’ve sent.
However I think I’ve found the root cause. Inside the thread function I was calling the nvtxXXX functions that NVIDIA provides. I was using it as defined in the documentation and here:

It seems that what caused the issue was that the parameter that is being passed to the nvtxRangePushEx function by reference, seems to have gone out of scope too soon.
I was not yet able to reproduce it with the code you’ve sent - will try later this week. However creating the nvtxEventAttributes_t variable that was passed to the nvtxRangePushEx on the stack and making sure it will not go out of scope prior to the pop operation + cudaStreamSynchronize(cudaStreamPerThread) calls has completed, cleared the lock down.

So I guess I’m now wondering about two things:

Could it be that if there’s a corruption of the nvtxEventAttribute_t parameter the cudaStreamSynchronize locks? why?
Why does all the profile function take the parameters (for example the nvtxEventAttribute_t to nvtxPushRangeEx) by reference and not by value?

thanks
Eyal

Topic		Replies	Views
--default-stream per-thread question CUDA Programming and Performance	2	821	August 22, 2018
Cuda nvcc default stream per-thread doesn't seem to be working CUDA Programming and Performance	0	774	August 10, 2020
Per-thread Default Stream Concurrency CUDA Programming and Performance	2	2236	February 10, 2018
Why my kernel code looses synchronization when running it in stream different from default ? CUDA Programming and Performance	9	977	November 14, 2016
GPU Pro Tip: CUDA 7 Streams Simplify Concurrency Technical Blog	51	2734	February 5, 2020
Concurrency about default stream CUDA Programming and Performance	3	2839	March 23, 2015
Kernels launched by multiple host threads get serialized by cudaStreamSynchronize(0) when --default- CUDA Programming and Performance	7	3037	October 12, 2021
GPU Pro Tip: CUDA 7 Streams Simplify Concurrency CUDA Programming and Performance	1	775	May 2, 2016
Different default-stream per thread behaviour when profiling with nvprof CUDA Programming and Performance	3	54	January 24, 2025
"--default-stream per-thread" on multi-GPU environment not working as expected? CUDA Programming and Performance	1	281	September 19, 2023

Multi threaded issue with --default-stream per-thread

Related topics