metrics of pgprof

Henrique_Renno · December 19, 2019, 5:49pm

Hello,

I have a doubt about some metrics that can be collected with pgprof. I want to know if the following equation holds:

inst_executed =
inst_executed_global_loads+inst_executed_local_loads+
inst_executed_shared_loads+inst_executed_surface_loads+
inst_executed_global_stores+inst_executed_local_stores+
inst_executed_shared_stores+inst_executed_surface_stores

Besides, how many bytes are moved by each instruction accumulated in inst_executed? 4 bytes?

Thanks

MatColgrove · December 23, 2019, 4:23pm

Hi Henrique,

Sorry for the late reply, I needed to check with the profiling team.

I want to know if the following equation holds

No, “inst_executed” may include additional loads not included in the others listed.

Besides, how many bytes are moved by each instruction accumulated in inst_executed? 4 bytes?

Not sure, nor am I sure that you directly translate the number of instruction bytes moved from the inst_executed metric given instructions are executed at the warp level.

If you’re using a CC 7.0 or greater device (Volta), you may consider moving to Nisght-Compute which is the successor to nvprof/pgprof when using metrics. Nsight-systems is the replacement when using the timeline.

https://docs.nvidia.com/nsight-compute/index.html#nsight-compute

-Mat

Topic		Replies	Views
“inst_executed” metric on - nvprof. What does it mean? Visual Profiler and nvprof	0	612	March 22, 2021
About instruction per warp metric CUDA Programming and Performance	5	955	March 24, 2020
Doubt regarding definition of "inst_executed" metric - nvprof Visual Profiler and nvprof	1	1381	July 5, 2017
Move instruction class is missing in nvprof metrics Visual Profiler and nvprof	0	561	January 8, 2020
Metrics smsp__sass_thread_inst_executed_op* returns n/a Nsight Compute	8	2018	August 2, 2019
Definition of sass__inst_executed* Nsight Compute	3	1152	January 4, 2022
N/A instructions executed Nsight Compute	1	400	September 17, 2020
What are the meanings of the items in nvprof --metrics all? CUDA Programming and Performance	0	468	October 31, 2018
Question about number of instructions Nsight Compute	1	610	June 24, 2022
How to get every instruction num from nv-nsight-cu-cli command-line Nsight Compute	4	1582	June 22, 2020

metrics of pgprof

Related topics