Perfworks and Metrics Detail

morteza · January 16, 2020, 8:02pm

I am trying to process “nsight-cuprof-report” file and extract metric from it directly, but I have some problems. I understand that the Perfwork API is being used in Nsight Compute, and if I pass the metrics using --metrics, regardless of the order the output file has certain order under Section tag and which can be used to extract values from Metrics tags (starting from 157)!

However if I use “.section” file to specify the metrics, the order in the Section tag in Nsight Compute output files is based on section file, which is inconsistent with NameID of Metrics tag. Starting from 157, with certain order which I couldn’t find yet.

I was hoping to use the “–query-metrics” output order to solve this problem and I encountered another problem, some metrics (such as “l1tex__t_sector_hit_rate”) which are available in the documentation and default section files are not shown in “–query-metrics” for my GPU (Titan RTX).

I would appreciate any hint or solution to these.

Sanjiv.Satoor · January 17, 2020, 5:36am

You can refer the “Report File Format” section in the Nsight Compute Customization Guide:
https://docs.nvidia.com/nsight-compute/CustomizationGuide/index.html#report-file-format

Also you can consider using the --csv option to get comma separated output - in case that is easier for you to process.

morteza · January 17, 2020, 5:30pm

Thanks! I can (almost) successfully decode the report file, but I cannot figure out the payload types, so I just use “ProfileResult” to decode, it decodes all metrics data I want, but large portion of report file cannot be decoded using this. I tried all other proto formats, no luck yet. For example the first payload is around 90% of report file and I could not decode it!

I am guessing they are sorted alphabetically, however that’s my guess and it seems correct for now. But if I pass an invalid metrics, “nv-nsight-cu” (the GUI) can identify it and will show yellow triangle, however I couldn’t find where it is stored in the report file. So one wrong metric and my code will produce wrong values. For now I can say that metrics up to 156 and last 10 are reserved, but this might be wrong!

Also, thanks for the csv suggestion, but I need nvtx data and it’s not supported in csv form.

felix_dt · January 20, 2020, 8:12am

The payloads per block are in the number and order as described by the BlockHeader structure, i.e. a BlockHeader with

NumSources=1,NumResults=2,SessionDetails=null,StringTable=<data>,PayloadSize=<size>,Process=<data>

starts with one payload of type SourceData, followed by two payloads of type ProfileResult, all three having a combined size of .
SourceData records are described in the ProfilerCommon.proto file. They contain the CUDA modules of your kernels, which is why they can make up considerable parts of the report.

Metrics are not stored necessarily alphabetically. Each metric is assigned a unique numerical ID, depending on the order they are encountered during processing. The IDs can be resolved to metric names using the StringTable entries in the BlockHeaders. StringTables can be split across multiple blocks, but the joined table up to block N is guaranteed to contain the IDs for all metrics encountered in this and all preceding blocks.

Topic		Replies	Views
Seeing n/a for metrics Nsight Compute	1	629	September 23, 2019
Why get all metrics with "n/a" in Nsight? Nsight Compute	5	1271	June 6, 2019
Nv-nsight-cu-cli --metrics gpu__time_active ./program show n/a data Nsight Compute cuda	1	959	June 3, 2020
Metrics Reference for 7.5 Nsight Compute	1	589	January 28, 2019
n/a for metrics Nsight Compute	8	1813	December 26, 2019
Can't Get NCU GUI To Import Properly Nsight Compute	8	1601	October 5, 2020
Nvprof metrics in nsight? Nsight Compute	1	967	June 3, 2021
How do i get some of the nvprof metrics in insight? Nsight Compute	0	790	June 2, 2021
NVIDIA® Nsight™ Compute 2021.2 is now available Nsight Compute	0	786	June 28, 2021
Get Nvprof-like information by Nsight Nsight Compute	5	790	June 27, 2023

Perfworks and Metrics Detail

Related topics