what is the mean of `gpu__compute_memory_access_throughput`

lsysee · August 20, 2019, 2:34pm

in the --section MemoryWorkloadAnalysis,

Metrics {
    Label: "Max Bandwidth"
    Name: "gpu__compute_memory_request_utilization_pct"
    Options {
      Name: "gpu__compute_memory_request_throughput.avg.pct_of_peak_sustained_elapsed"
      Filter {
        MinArch: TU10x
      }
    }
  }

what is the means of gpu__compute_memory_access_throughput
and how to get memory load/store efficiency using nv-nsight-cu-cli for sm_75
thanks in advance

felix_dt · August 21, 2019, 11:29am

In the section file you pasted, there is no metric gpu__compute_memory_access_throughput, did you put the wrong name by any chance? In general, you should be able to get descriptions for almost all metrics using the command line query functionality, e.g. for SM 75, RTX2080Ti

nv-nsight-cu-cli --query-metrics --chip tu102

This will print a (long) list of all metric base names with their descriptions. You can check Nsight Compute CLI :: Nsight Compute Documentation for more details on how to use that option, and how to query the valid suffixes for those metric names on Volta or newer architectures.

If you already know the metric (base) name to query, e.g. gpu__compute_memory_access_throughput, you can also use directly

nv-nsight-cu-cli --query-metrics --chip tu102 --metrics gpu__compute_memory_access_throughput

For the metrics you listed, the descriptions are

gpu__compute_memory_access_throughput: the average Compute Memory Pipeline : throughput of internal activity within caches and DRAM, as a % of peak burst rate over active cycles
gpu__compute_memory_request_throughput: the average Compute Memory Pipeline : throughput of interconnects between SM<->Caches<->DRAM, as a % of peak burst rate over active cycles

Note that their are currently no direct mappings for most nvprof *_efficiency metrics in Nsight Compute (see also Nsight Compute CLI :: Nsight Compute Documentation )

lsysee · August 21, 2019, 3:08pm

thanks for your detailed answer.
by the way, how can i get the descriptions of other metrics? and, can i get *_efficiency by the combination of some metrics?

lsysee · August 22, 2019, 1:01pm

felix_dt:

In the section file you pasted, there is no metric gpu__compute_memory_access_throughput, did you put the wrong name by any chance? In general, you should be able to get descriptions for almost all metrics using the command line query functionality, e.g. for SM 75, RTX2080Ti
nv-nsight-cu-cli --query-metrics --chip tu102
This will print a (long) list of all metric base names with their descriptions. You can check Nsight Compute CLI :: Nsight Compute Documentation for more details on how to use that option, and how to query the valid suffixes for those metric names on Volta or newer architectures.

If you already know the metric (base) name to query, e.g. gpu__compute_memory_access_throughput, you can also use directly
nv-nsight-cu-cli --query-metrics --chip tu102 --metrics gpu__compute_memory_access_throughput
For the metrics you listed, the descriptions are
gpu__compute_memory_access_throughput: the average Compute Memory Pipeline : throughput of internal activity within caches and DRAM, as a % of peak burst rate over active cycles
gpu__compute_memory_request_throughput: the average Compute Memory Pipeline : throughput of interconnects between SM<->Caches<->DRAM, as a % of peak burst rate over active cycles
Note that their are currently no direct mappings for most nvprof *_efficiency metrics in Nsight Compute (see also Nsight Compute CLI :: Nsight Compute Documentation )

thanks for your detailed answer.
by the way, how can i get the descriptions of other metrics? and, can i get *_efficiency by the combination of some metrics?

felix_dt · August 22, 2019, 1:36pm

how can i get the descriptions of other metrics

nv-nsight-cu-cli --query-metrics

Topic		Replies	Views
I can't find explanations of nsight compute metrices Nsight Compute	2	133	June 2, 2025
Nv-nsight-cu-cli --metrics gpu__time_active ./program show n/a data Nsight Compute cuda	2	891	October 12, 2021
Metric references and description Nsight Compute	7	4658	March 2, 2024
Metrics smsp__sass_thread_inst_executed_op* returns n/a Nsight Compute	8	1801	August 2, 2019
What exactly does SM Active Cycles mean? Nsight Compute	3	1127	July 30, 2024
Nvprof metrics in nsight? Nsight Compute	1	869	June 3, 2021
Why get all metrics with "n/a" in Nsight? Nsight Compute	5	1131	June 6, 2019
What is the 'ga10x-gfxt' Metric set in collect GPU metric option? Profiling Linux Targets	4	827	March 30, 2023
Why some metric sets in ncu is not enable? Nsight Compute	6	1380	November 27, 2023
RTX6000 profiling issue Nsight Compute	7	1176	October 12, 2021

what is the mean of `gpu__compute_memory_access_throughput`

Related topics