Why cuda kernel computation cannot overlap with CPU to GPU data transfer?

shunkangz · May 21, 2024, 6:19am

I tried to prefetch some data before the computation start. It is strange that sometimes the computation and CPU to GPU transfer can overlap well, but sometimes it just execute in sequence. According to nsight system results, we can observe that the kernel execution time between 10829 and 10830 is around 9ms. However, there is no dependency between the data copy and computation.

I just repeat the similar process for a few times. At another time point, the two can overlap very well. The code is exact the same. May I know what the problem cause this?

Robert_Crovella · May 21, 2024, 2:11pm

Is there any change in behavior if you profile your code specifying

CUDA_MODULE_LOADING=EAGER nsys profile ...

?

Topic		Replies	Views
Is it possible to overlap memory access and computation inside the same kernel? CUDA Programming and Performance	5	1221	September 30, 2022
Strange behavior with overlap of transfer and compute CUDA Programming and Performance	4	4014	October 19, 2011
No overlap between communication and computation across CUDA streams in PyTorch CUDA Programming and Performance	1	67	January 15, 2026
Overlapping kernel execution and data transfer CUDA Programming and Performance	9	3600	May 10, 2017
problems with using streams when overlapping transfer and kernel execution CUDA Programming and Performance	0	980	November 11, 2009
cpu function and gpu kernel overlap CUDA Programming and Performance	12	1552	July 25, 2017
Bug when overlapping tranfert & data CUDA Programming and Performance	1	609	February 11, 2011
Can I use blocks instaed of streams to overlap data transfer with compute? CUDA Programming and Performance cuda , kernel	2	389	October 12, 2021
Data transfers are not overlapping CUDA Programming and Performance	2	678	February 7, 2018
Overlapping kernel execution and memory copy CUDA Programming and Performance	6	9822	September 22, 2007

Why cuda kernel computation cannot overlap with CPU to GPU data transfer?

Related topics