--default-stream per-thread question

eyalhir74 · August 20, 2018, 6:41am

Hi,
Anyone has experience with the --default-stream per-thread flag in a production environment?
Would it work in a production environment, under stress with many threads (and therefore streams) openning and closing all the time, 24x7?

Also, what would happen with a cudaMemcpy when running under this configuration? would a non-pinned memcpy running on a stream (created due to the --default-stream per-thread flag), would synchronize everything or just the currently created one?

thanks
Eyal

eyalhir74 · August 22, 2018, 6:58am

Any idea??

thanks
Eyal

Robert_Crovella · August 22, 2018, 2:09pm

for non-pinned cudaMemcpy behavior, see here:

[url]https://devtalk.nvidia.com/default/topic/1038581/cuda-programming-and-performance/performances-of-multi-thread-vs-multi-process-with-mps/post/5276929/#5276929[/url]

it may affect other activity on the device, besides just the activity on the stream its on

Topic		Replies	Views
Multi threaded issue with --default-stream per-thread CUDA Programming and Performance	3	1024	November 20, 2018
Kernels launched by multiple host threads get serialized by cudaStreamSynchronize(0) when --default- CUDA Programming and Performance	7	3072	October 12, 2021
GPU Pro Tip: CUDA 7 Streams Simplify Concurrency Technical Blog	51	2919	February 5, 2020
"--default-stream per-thread" on multi-GPU environment not working as expected? CUDA Programming and Performance	1	284	September 19, 2023
Per-thread Default Stream Concurrency CUDA Programming and Performance	2	2270	February 10, 2018
Why my kernel code looses synchronization when running it in stream different from default ? CUDA Programming and Performance	9	991	November 14, 2016
Cuda nvcc default stream per-thread doesn't seem to be working CUDA Programming and Performance	0	788	August 10, 2020
Concurrency about default stream CUDA Programming and Performance	3	2853	March 23, 2015
Is cudaMemcpyAsync + cudaStreamSynchronize on default stream equal to cudaMemcpy (non-async) CUDA Programming and Performance	7	4310	December 12, 2019
Default Stream per Thread - Driver API CUDA Programming and Performance	2	2070	August 11, 2019

--default-stream per-thread question

Related topics