how does nvprof collect events?

dongxiao · July 18, 2017, 8:24am

nvprof could collect how many times spefific events happen.But I’m not sure how does nvprof do that.
More specificlly,does nvprof keep collecting information during the whole time when application run？
Or it just does sampling when application is running,so the results of events reported by nvprof is not equal to how many times thoese events actually happens.

Thanks a lot

veraj · August 14, 2017, 5:41am

Hi, dongxiao

nvprof collects events for a kernel in isolation i.e. by serializing the kernels in the application, so that events can be attributed to a specific kernel. This helps user understand and analyse the optimization opportunities for each kernel separately. If the specified events/metrics can’t be profiled in a single run of the application, nvprof by default replays each kernel multiple times until all the events/metrics are collected. The --replay-mode option can be used to change the replay mode. In “application replay” mode, nvprof re-runs the whole application instead of replaying each kernel, in order to collect all events/metrics.

When collecting events/metrics, nvprof profiles all kernels launched on all visible CUDA devices by default. The profiling scope can be limited to a specific context, stream, kernel or kernel invocation. More details about profiling scope can be found at Profiler :: CUDA Toolkit Documentation";

Topic		Replies	Views
profiling for a long running applications Visual Profiler and nvprof	3	1921	August 9, 2017
Profiling deadloop (replay kernel) with nvprof on deep neural network Visual Profiler and nvprof	8	3301	August 24, 2017
nvprof. How many registers for nvprof? Visual Profiler and nvprof	3	1113	July 13, 2018
Profiling application with CUPTI in a separate process? CUDA Programming and Performance	2	853	July 6, 2017
How to collect the event value every time the kernel function been invocated? Visual Profiler and nvprof	4	1730	November 23, 2022
NVPROF - How Does It Work? CUDA Programming and Performance	0	657	June 4, 2018
profiling mpi programs CUDA Programming and Performance	6	1382	March 26, 2018
How to measure all available metrics/events in one command line with nvprof Visual Profiler and nvprof	1	6227	October 24, 2013
How to use NVPROF on code compiled with NVRTC? Visual Profiler and nvprof	10	1643	October 12, 2021
use nvprof --metrics to collect sm information following timeline order Visual Profiler and nvprof	1	507	March 6, 2020

how does nvprof collect events?

Related topics