nvprof: timelines for GPU metrics values. --metrics and --print-gpu-trace options.

pyotr777 · January 18, 2018, 1:04pm

When you use nvprof with --print-gpu-trace and with --csv options you get a nice table with timestamps and values for some metrics:

"Start","Duration","Grid X","Grid Y","Grid Z","Block X","Block Y","Block Z","Registers Per Thread","Static SMem","Dynamic SMem","Size","Throughput","SrcMemType","DstMemType","Device","Context","Stream","Name"
s,ms,,,,,,,,KB,KB,MB,GB/s,,,,,,
0.298472,0.001952,,,,,,,,,,0.001953,0.977125,"Pinned","Device","Tesla M60 (0)","1","7","[CUDA memcpy HtoD]"
0.298653,0.001408,,,,,,,,,,0.001953,1.354651,"Pinned","Device","Tesla M60 (0)","1","7","[CUDA memcpy HtoD]"

This format is very convenient for plotting timelines and analyzing application behavior.

However, it only has a number of memory-related metrics. If you need other metrics and try to add --metrics option, nvprof output format changes: it no longer includes timestamps.

How is it possible to get timestamps with other metrics?

veraj · January 22, 2018, 2:30am

Hi, pyotr777

Have you tried using nvvp (the Visual Profiler) yet ?

You can collect metrics from UI and check “GPU details”, there is timestamp there.

Also you can export the data as csv.

Hope this helps.

pyotr777 · January 22, 2018, 4:52am

Thank you!

I have tried nvvp, but so far I couldn’t get a timeline of metrics. I used to run nvprof with -o option and open saved profile in Visual Profiler.
There is a timeline there, but it only has kernel invocations and memory operations. I don’t even see FLOPs counters or any other metrics besides duration for kernels.

I tried to run nvprof with --analysis-metrics option. Unfortunately it stops with an error:
cupy.cuda.memory.OutOfMemoryError: out of memory to allocate 134217728 bytes (total 2571250176 bytes)
A partial file nvprof creates does contain more metrics for kernels, but, again, nvprof always breaks and I cannot see metrics on a timeline.

What do you mean “to collect metrics from UI”? I know there is a way to run remote profiling with nvprof from Visual Profilier. Are there any other profiling methods?

What I need is metrics timeline like the graph below. It was plotted from a CSV file like the one I mentioned in the first post.

veraj · January 22, 2018, 5:08am

After timeline generated, you can select Run->Configure Metrics and Events->Apply and Run to choose any metrics you are interested.

Then you can get all metrics value listed in GPU detais.

pyotr777 · January 22, 2018, 7:42am

I have nvvp installed on my local computer. I created a new session with remote connection, but now Visual Profiler cannot connect.

Failed to connect sshd on "EC2-52-91-16-22.COMPUTE-1.AMAZONAWS.COM:22"
Failed to connect sshd on "EC2-52-91-16-22.COMPUTE-1.AMAZONAWS.COM:22"
Failed to connect sshd on "EC2-52-91-16-22.COMPUTE-1.AMAZONAWS.COM:22"

Anyway, can you tell if there is a difference between running remote profiling from Visual Profiler and running nvprof on remote machine using CLI?

Topic		Replies	Views
How to generate performance metrics and kernel execution time in nvprof? Visual Profiler and nvprof	2	3162	March 14, 2018
nvprof is too slow Visual Profiler and nvprof	12	4954	January 25, 2022
nvprof --metrics branch_efficiency..... Why no metrics ? Visual Profiler and nvprof	3	1780	December 14, 2019
How do i get some of the nvprof metrics in insight? Nsight Compute	0	773	June 2, 2021
use nvprof --metrics to collect sm information following timeline order Visual Profiler and nvprof	1	547	March 6, 2020
Is (nvprof metrics equivalent) CLI interface for printing result exists? Nsight Compute	6	819	May 31, 2019
CUDA Pro Tip: nvprof is Your Handy Universal GPU Profiler Technical Blog	35	2716	September 5, 2021
nvprof doesn't generate any output data CUDA Programming and Performance	0	1128	September 23, 2018
nvprof print precise kernel start time in --print-gpu-trace mode Visual Profiler and nvprof	1	822	December 26, 2019
Profiling metrics Visual Profiler and nvprof	5	794	July 14, 2023

nvprof: timelines for GPU metrics values. --metrics and --print-gpu-trace options.

Related topics