CUDA Visual Profiler: Not showing overlapping memory copies

zenna · March 11, 2010, 4:13pm

This question has come up before on these forums but was unanswered:

The CUDA visual profiler does not show overlapping memory transfers and kernel executions in the GPU Time Width Plot. I have seen this in my own code and when running the simplestreams example in the SDK.

Is this by design or a bug?

How is one to determine how well they are overlapping memory transfers and computation?

Furthermore, in the visual profiler version 2.3, it shows all memory transfers in stream 0, which is just incorrect. Again this has been observed both in my own code and the simplestreams SDK example.

Thanks

Zenna

As a side note, in my opinion it would be great if a lot more of the experts here would participate in sites designed for QA such as stackoverflow. It is too easy for questions to slip under the radar and go unanswered in the forum format.

Topic		Replies	Views
concurrent copy and execution not showing in visual profiler CUDA Programming and Performance	0	3629	July 22, 2009
streams strange behaviour with profiler CUDA Programming and Performance	0	555	November 25, 2014
Visual profiler missing information CUDA Programming and Performance	6	8542	May 26, 2009
Visual Profiler: tracking of concurrent data transfers and kernel executions CUDA Programming and Performance	2	607	January 20, 2011
CUDA stream performance CUDA Programming and Performance	5	2498	July 23, 2013
CUDA Streams problem CUDA Programming and Performance	0	542	February 3, 2013
Maxwell. Overlapping data transfers CUDA Programming and Performance	6	1287	January 29, 2015
Visual Profiler displays erroneous output with multiple GPUs Profiler problem on multi-gpu scaling b CUDA Programming and Performance	0	842	May 9, 2012
Problems with Streams Very strange!!! CUDA Programming and Performance	1	7689	November 26, 2009
what does Visual Profiler mean regarding this analysis result? Kernel time+Memory copy time does not CUDA Programming and Performance	1	562	January 2, 2012

CUDA Visual Profiler: Not showing overlapping memory copies

Related topics