NSIGHT equivalent for branch_efficiency in nvprof

rajprasannap · September 23, 2020, 6:45am

In nvprof, the metric used to determine branch efficiency (warp divergence) is branch_efficiency. nvprof has been deprecated for devices with compute capability >= 7.2 and NSIGHT Compute is used instead. What is the equivalent to branch_efficiency in NSIGHT?

Thanks.

felix_dt · September 24, 2020, 6:22am

As indicated in the nvprof transition guide Nsight Compute CLI :: Nsight Compute Documentation, branch_efficieny is not directly available in Nsight Compute at this point. The team is looking into providing a matching mapping in a future release.

In the meantime, please check if any of the following related metrics is useful for your case:

smsp__average_warp_latency_issue_stalled_branch_resolving
average number of warp cycles spent waiting for a branch target address to be computed, and the warp PC to be updated

smsp__average_warps_issue_stalled_branch_resolving_per_issue_active
average number of warps resident per issue cycle, waiting for a branch target address to be computed, and the warp PC to be updated

smsp__inst_executed_op_branch
number of warp instructions executed: BRA, BRX, JMP, JMX, CALL, RET here description needs to include YIELD, EXIT, WARPSYNC … etc instruction in description

smsp__warp_issue_stalled_branch_resolving_per_warp_active
proportion of warps per cycle, waiting for a branch target address to be computed, and the warp PC to be updated

smsp__warps_issue_stalled_branch_resolving
cumulative number of warps waiting for a branch target address to be computed, and the warp PC to be updated

Note that you might need to add a valid suffix to the base name when collecting the metric in Nsight Compute

Topic		Replies	Views
Nvprof metrics in nsight? Nsight Compute	1	965	June 3, 2021
The results of nsight compute metrics are almost all 0 Nsight Compute cuda	1	607	September 28, 2023
nvprof --metrics branch_efficiency..... Why no metrics ? Visual Profiler and nvprof	3	1803	December 14, 2019
Using command "nsys nvprof" to measure warp_execution_efficiency and other metrics Profiling Linux Targets nsight	1	616	August 27, 2021
How do i get some of the nvprof metrics in insight? Nsight Compute	0	785	June 2, 2021
Get different metric results using nsight compute and nvprof Nsight Compute nvbugs	3	1513	October 19, 2023
How to get nvprof equivalent of nvprof metrics --query-metrics Nsight Compute	4	322	November 27, 2024
scope of nvprof metric Visual Profiler and nvprof	0	484	October 18, 2019
Shared memory efficiency in Nsight Nsight Compute	1	663	November 24, 2019
I found ncu's branch efficiency metric is always zero for any kernels Nsight Compute cuda	4	476	October 25, 2024

NSIGHT equivalent for branch_efficiency in nvprof

Related topics