Obtain the Raw Data from the PM Sampling Timeline View

yet-another-user · September 16, 2025, 4:27pm

Hello,

I profiled some kernels using Nsight Compute. The PM sampling timeline view gives me the information I want to analyze for my kernel.
I would like to extract the data/list of points from the timeline view. In the raw data, I only see aggregates of this view, such as the sum. Yet the Nsight Compute tool is able to show them to me through time, at each t step (here, 1000 nanoseconds).

A screenshot of what I am referring to:

How can I extract the raw data visualized by Nsight Compute, either using the ncu cli or the Python Report Interface? I’ve been searching for this information for some time, and could not find anything online.

Thanks a lot for the help.

felix_dt · September 16, 2025, 5:50pm

PM Sampling metrics are stored as instanced metrics, with N pairs of <correlation ID : value>

You can

Use the metric details window to view and copy their values. Open the window and select any value on the timeline to view that metric (name and values).
Print metric instances from the command line with --print-metric-instances details
Use the Python Report Interface (PRI) to access instance correlation IDs (timestamps for sampling metrics) and values. You can find an example on how to access instances metric through Python here.

Note that in the timeline and metric details window, sampling metrics are generallyctx-switched and aligned to 0, unless context switch filter is disabled. In the CLI and PRI export, they are not (timestamps don’t start from 0).

yet-another-user · September 18, 2025, 3:29pm

Thank you! The Python notebook for the Python Report Interface helped a lot.

How is the data context switched and aligned to 0? The Workload Execution tab in yellow delimits the start and stop points of the kernel execution. How does NCU computes them? Is there a way to obtain them from the report?

I tried using the duration time of the kernel, but samples start being collected much before the beginning of the kernel execution.

felix_dt · September 29, 2025, 8:40am

The ctx switch information is stored in metrics starting with profiler__pmsampler_ctxsw.

Alignment to 0 simply means that no absolute timestamps are used, but only relative ones (relative to the first timestamp captured in a metric). Note that all metrics from the same replay pass (see metric profiler__pmsampler_pass_groups) implicitly use the same timestamps/are implicitly aligned.

Workload execution is not computed, it is measured/collected. You can check the metrics profiler__timestamp_workload_start,profiler__timestamp_workload_end

All these metrics are also documented in the Metrics Reference.

system · October 13, 2025, 8:41am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Timelline View of Using PM Sampling to Get Tenso Core Utilitation Nsight Compute	3	31	December 19, 2025
Which metrics can I see in the PM sampling timeline Nsight Compute	16	954	January 19, 2024
Time series data of metrics Nsight Compute	11	705	July 20, 2023
Can not find "The timeline row Workload Execution" in Nsight compute CUDA Programming and Performance	6	113	February 11, 2025
Can we use nsight compute / system (in command line) to get the sampled time or executed instructions (%) information inside a kernel? Nsight Compute	2	343	October 30, 2025
Can we extract the timestamp data for the SM active cycles in ncu report Nsight Compute	2	220	June 11, 2025
How to access (or compute) block durations and warp durations from raw data? Nsight Compute	14	825	May 11, 2020
Nsight Compute PM Sampling Nsight Compute	7	116	October 15, 2025
Does Nsight compute provide timeline chart when running a kernel? Nsight Compute	10	978	January 17, 2024
PM Sampling metrics description CUPTI – CUDA Profiler Tools Interface	4	424	June 5, 2025

Obtain the Raw Data from the PM Sampling Timeline View

Related topics