CUDA Streams: Start at the same time

asandip785 · November 12, 2021, 5:21pm

Hi,

I implemented streams in my CUDA script as shown.

   PT1<<<gride, blocke>>>(dvxdx, dvydy, dvxdy, dvydx, d_vx, d_vy, d_alpha, d_beta, d_index,nbe);
    cudaDeviceSynchronize();

    PT1_Etanbe<<<gride, blocke, 0, stream1>>>(Eta_nbe, d_etan, d_areas, nbe);


    PT1_x<<<gride, blocke, 0, stream2>>>(dvxdx, dvydy, dvxdy, dvydx, d_vx, d_vy, d_alpha, d_beta, d_index, kvx,  d_etan, d_Helem, d_areas, d_isice, nbe);

    PT1_y<<<gride, blocke, 0, stream3>>>(dvxdx, dvydy, dvxdy, dvydx, d_vx, d_vy, d_alpha, d_beta, d_index, kvy,  d_etan, d_Helem, d_areas, d_isice,  nbe);

I am looking to run the kernels in streams 1, 2 and 3 simultaneously. The qdrep file

shows that the kernels in those streams don’t begin at the same time and there is not much overlap in time. What am I missing?

Thanks for any information you can provide,
Anjali

Robert_Crovella · November 12, 2021, 5:37pm

This is a very common question. If each of your kernels fully occupy the GPU, there is no reason to expect overlap/concurrency.

asandip785 · November 12, 2021, 5:48pm

Thank you for your reply. Does that mean if I want to achieve full concurrency I would need to run the streams on multiple GPUs (each stream on a different GPU)?

Robert_Crovella · November 12, 2021, 5:55pm

That should work.

Topic		Replies	Views
Running kernels concurrently on parallel streams/same start time CUDA Programming and Performance	1	318	July 7, 2022
CUDA concurrency problem - multi-GPU vector add Visual Profiler and nvprof cuda , kernel	0	921	July 8, 2021
My streams are not running concurrently CUDA Programming and Performance	7	1775	March 6, 2018
Why streams cant run concurrently CUDA Programming and Performance	4	921	March 22, 2018
Cannot see concurrent kenrel execution by stream CUDA Programming and Performance	2	534	November 16, 2017
Kernel launch concurrency CUDA Programming and Performance	10	1803	December 11, 2014
Why kernel executions in different streams are not parallel? CUDA Programming and Performance	4	2655	April 29, 2019
How to Launch Cuda kernel in different processes CUDA Programming and Performance	8	3726	November 6, 2018
Distinct Kernels on Concurrent Streams? CUDA Programming and Performance	3	1210	June 9, 2009
Concurrent kernel execution CUDA Programming and Performance	2	328	March 26, 2024

CUDA Streams: Start at the same time

Related topics