Default Stream per Thread - Driver API

tamirg · August 7, 2019, 10:29am

Hi,
I was looking into using –default-stream per-thread or equivalent #define CUDA_API_PER_THREAD_DEFAULT_STREAM for driver API calls (specifically FFMpeg cuvid decoder).
But couldn’t find any documentation about it.

The program loads the DLL dynamically - there is no call to nvcc in which I can add --default-stream per-thread and adding CUDA_API_PER_THREAD_DEFAULT_STREAM will not affect the loaded DLL functions.

Looking into cuda.h I’ve seen the following macro being used when CUDA_API_PER_THREAD_DEFAULT_STREAM is defined:

#if defined(__CUDA_API_VERSION_INTERNAL) || defined(CUDA_API_PER_THREAD_DEFAULT_STREAM)
    #define __CUDA_API_PER_THREAD_DEFAULT_STREAM
    #define __CUDA_API_PTDS(api) api ## _ptds
    #define __CUDA_API_PTSZ(api) api ## _ptsz

And used in some of the APIs as follows:

#define cuMemcpyHtoD                        __CUDA_API_PTDS(cuMemcpyHtoD_v2)
...
#define cuMemcpy2D                          __CUDA_API_PTDS(cuMemcpy2D_v2)
...
#define cuStreamSynchronize                 __CUDA_API_PTSZ(cuStreamSynchronize)

Does it mean that by dynamic loading the ptds / ptsz versions of the APIs used in FFMpeg I would be able to achieve “default stream per thread” behaviour?

Robert_Crovella · August 7, 2019, 1:36pm

As indicated here:

One possible approach is to explicitly access the per-thread default stream.

In the CUDA runtime API, that stream has a particular handle:

cudaStreamPerThread

In the CUDA driver API, that handle is:

CU_STREAM_PER_THREAD

[url]CUDA Driver API :: CUDA Toolkit Documentation

tamirg · August 11, 2019, 8:13am

Thanks!
I don’t know how I missed it :)

Topic		Replies	Views
what's the difference between suffix _ptds and _ptsz ? CUDA Programming and Performance	3	2220	July 8, 2022
"--default-stream per-thread" on multi-GPU environment not working as expected? CUDA Programming and Performance	1	230	September 19, 2023
--default-stream per-thread question CUDA Programming and Performance	2	750	August 22, 2018
CUDA per-thread and cudnn behaviour CUDA Programming and Performance	1	1280	September 15, 2017
CUDA Fortran Equivalent to nvcc --default-stream per-thread Legacy PGI Compilers	1	962	September 19, 2019
Per-thread Default Stream Concurrency CUDA Programming and Performance	2	2051	February 10, 2018
How does "cudaStreamPerThread" variable behave without "--default-stream per-thread" compilation option? CUDA Programming and Performance	1	337	November 17, 2023
How to set --default-stream per-thread in nsight Nsight Eclipse Edition cuda	4	912	July 10, 2020
Adding CUDA streams to threaded software CUDA Programming and Performance	7	8568	August 2, 2011
Cuda nvcc default stream per-thread doesn't seem to be working CUDA Programming and Performance	0	701	August 10, 2020

Default Stream per Thread - Driver API

Related topics