Unexplained gap in profiling multi-GPU timeline

CudaaduC · October 5, 2017, 11:35pm

Profiled via NVVP an application which uses streams and 2 GPUs and noticed a gap in one of the timelines which seems to be unexplained (the arrow points to the gap);

[url]https://imgur.com/a/KAxO0[/url]

This is a rather complicated process which involves a combination of host-to-device copies, custom kernels, multiple calls to the cufft library and device-to-host copies. This is all using streams and splitting the problem evenly between the two GPUs.

While I am happy with the overall amount of overlap between compute and bi-directional copies I wonder about that that gap for GPU #0 (which happens to be the GPU connected to the display).

Any ideas or ways to find out?

CUDA 8.0
Windows 8.1 x64
NVVP (claims to be using CUDA 9.0 but I compile against CUDA 8.0)
2x GTX 1080TI

mahmood.nt · October 27, 2020, 6:56pm

Hi
I also see such gaps in the profiler’s output Pasteboard - Uploaded Image
I don’t know if that means the GPU is idle in the gaps. If yes, why?

Topic		Replies	Views
strange GPU idle time in profiler CUDA Programming and Performance	4	1023	June 27, 2011
streams strange behaviour with profiler CUDA Programming and Performance	0	532	November 25, 2014
Bugs in the profiler 1.0? CUDA Programming and Performance	2	3123	September 6, 2008
Visual Profiler displays erroneous output with multiple GPUs Profiler problem on multi-gpu scaling b CUDA Programming and Performance	0	801	May 9, 2012
GPU timestamp for Concurrent CPU and GPU execution Compute Visual Profiler Question CUDA Programming and Performance	1	1000	November 24, 2010
Problems with Streams Very strange!!! CUDA Programming and Performance	1	7648	November 26, 2009
concurrent copy and execution not showing in visual profiler CUDA Programming and Performance	0	3603	July 22, 2009
Massive idle time CUDA Programming and Performance	3	1248	March 9, 2011
Profiler Times just need some info CUDA Programming and Performance	4	4555	June 16, 2010
Profiler Interpretation of profiler results CUDA Programming and Performance	3	5887	July 3, 2007

Unexplained gap in profiling multi-GPU timeline

Related topics