Question about PM sampling

FlyK · November 2, 2023, 2:48am

I have some questions about the PM sampling feature in the latest version of ncu, so I would like to understand its specific meaning. Could you please provide clarification?

I would like to confirm the valid range of the pm-sampling-interval, as it is not explicitly mentioned on the official website. Could you please provide this information?

image1920×545 70.9 KB

2.I am not clear about the meaning of ‘pass group’ and ‘pass group X active’ in PM sampling. Could you please provide an explanation?

image1920×1306 200 KB

Thanks.

jmarusarz · November 2, 2023, 8:06pm

I’m not sure about the interval. I’ll do some digging and get back to you.

With respect to the pass groups, not all metrics can be collected at the same time so they are grouped into multiple passes. Pass groups are define which metrics are collected together. If you hover over another row, it should tell you what group it was a member of during the collection. At the end, all groups are then composed onto the same timeline.

FlyK · November 3, 2023, 1:47am

Thank you very much, your response has been very helpful in helping me understand the concept of ‘pass group’.

Please allow me to ask two more questions.

3.I don’t quite understand the meaning of ‘context switch trace’ and how it helps us analyze specific issues in the pass group. Could you please provide an explanation?

4.Does PM sampling support collection under MIG and Virtual Function in ncu?

Thanks.

felix_dt · November 3, 2023, 8:15am

I don’t quite understand the meaning of ‘context switch trace’ and how it helps us analyze specific issues in the pass group.

Context switch trace is explained in the documentation. The purpose is to align the data sampled across multiple passes and to filter it to only the CUDA context that is being profiled.

Does PM sampling support collection under MIG and Virtual Function in ncu?

PM sampling is not supported on vGPU (assuming that’s what you mean by “virtual function”). It is supported on MIG (but context switch trace is not supported for it, making the collected data slightly harder to interpret).

I would like to confirm the valid range of the pm-sampling-interval

The minimal interval for sampling depends on the GPU architecture. For Turing and GA100, it is 20000 cycles. For GA10x and newer, it is 1000ns.

FlyK · November 3, 2023, 2:36pm

Thank you for your response, it has been very helpful for me to understand PM sampling.

Topic		Replies	Views
Which metrics can I see in the PM sampling timeline Nsight Compute	16	775	January 19, 2024
PM sampling doesn't work Nsight Compute	10	845	December 5, 2023
How to utilize PM sampling? Nsight Compute	2	605	April 26, 2024
Question about PC sampling Nsight Compute	3	506	December 20, 2023
What exactly does SM Active Cycles mean? Nsight Compute	3	628	July 30, 2024
How to get the bytes read/write sum about Memory access between GPUs? Nsight Compute	7	881	March 20, 2024
Metric references and description Nsight Compute	7	4211	March 2, 2024
How can I use PmSamling with ncu? Nsight Compute	2	365	June 28, 2024
Time series data of metrics Nsight Compute	11	496	July 20, 2023
How to get real-time SM occupancy Nsight Visual Studio Edition pytorch	1	2048	August 16, 2022

Question about PM sampling

Related topics