Why is there a period of idle time between kernels or transfers，what happened during this idle time？

lylyly6666 · September 26, 2023, 8:11am

I use nvprof to get some infomation like startTime, Duration, KernelName during training a model, the command:

nvprof --csv --log-file log.csv --print-gpu-trace python test1.py

the result is following:

and I find there is a idle time between some transfers, for example:

8.238898s + 0.264026s = 8.502924s < 8.534692s

and there is also a idle time between some kernels, for example:

8.602683s + 0.0217733s = 8.6244563s < 8.624457s

usually, the idle time between transfers is longer than the kernels, and I want to know what happened during this idle time? In addition， there is a Name called [CUDA memcpy DtoD]， what is the process doing？

Any response would be greatly appreciated！

veraj · November 21, 2023, 4:09am

Hi, @lylyly6666

Sorry for the late response !
It will be good if you can move to using Nsight System instead of nvprof.
It will be easier to look at the timeline in Nsight System to identify why there are gaps between kernel launches for memory transfers.
These gaps could be due to various reasons such as:
a) overhead of CUDA APIs
b) some other processing in application code between the CUDA calls.
c) profiler overhead
d) some synchronization

veraj · January 15, 2024, 12:00am

This topic was automatically closed after 12 days. New replies are no longer allowed.

Topic		Replies	Views
"idle time" between kernel calls ( from NVVP inspection) CUDA Programming and Performance	4	5162	December 10, 2012
Why is there idle time between kernels before and after gpu_memset? CUDA Programming and Performance cuda , kernel , python	1	15	January 16, 2025
High idle times between kernel exeuction CUDA Programming and Performance	0	2145	April 18, 2012
Gap between some thread calls CUDA Programming and Performance	6	1260	October 30, 2014
strange GPU idle time in profiler CUDA Programming and Performance	4	1002	June 27, 2011
Gaps in CUDA Trace Profiling Linux Targets	5	695	November 10, 2022
What is "idle time" in width plot? blanks between kernels in profiler width plot CUDA Programming and Performance	0	2008	November 3, 2008
"Why cudaMemcpy Shows Higher Duration in API Calls Than GPU Activities in nvprof?" nvc, nvc++ and nvfortran	5	15	January 13, 2025
What the gaps on the nvvp pipeline mean? And how to shrink the gap size? CUDA Programming and Performance	6	747	September 15, 2019
nvprof and difference in time reported CUDA Programming and Performance	4	1082	September 16, 2017

Why is there a period of idle time between kernels or transfers，what happened during this idle time？

Related topics