Measuring peak read/write bandwidth across device memory

AKKamath · May 15, 2020, 8:38pm

Hi all,

I wanted to measure the peak read/write (load/store) bandwidth separately, across the device memory during my kernel executions. I was wondering whether Nsight had any parameter which could capture that.

If not, is there any other tool I could possibly use to obtain it? I know NVML provided functionality to do that for data across PCIe, but I needed it for GPU memory.

Thanks

felix_dt · May 19, 2020, 8:52am

I would suggest to start by collecting the MetricWorkloadAnalysis* sections, either separately, or together with any other metrics and/or sections you are interested in. This should give you several tables and charts in the UI, when opening as a report.

nv-nsight-cu-cli --section "MemoryWorkloadAnalysis.*" (app)

If you only want to collect individual metrics, you can start with

nv-nsight-cu-cli --metrics dram__bytes_write.sum,dram__bytes_read.sum,dram__bytes_write.sum.pct_of_peak_sustained_elapsed,dram__bytes_read.sum.pct_of_peak_sustained_elapsed (app)

See Nsight Compute :: Nsight Compute Documentation for the list of available sections. The current active set is also available via --list-section or in the Sections/Rules Info window in the UI.

See the --query-metrics and --query-metrics-mode command line options in Nsight Compute CLI :: Nsight Compute Documentation for how to query individual metric names.

Topic		Replies	Views
Profiling device memory bandwidth utilization Nsight Compute	5	2891	September 5, 2022
PCIe bandwidth information CUPTI – CUDA Profiler Tools Interface pcie	5	1470	September 1, 2023
Question regarding memory profiling Nsight Compute	2	125	November 12, 2024
How to get the compute and memory throughput of GPU from the perspective of the whole GPU system Nsight Compute cuda	4	1231	September 23, 2022
How to calculate the total count of bytes were loaded/stored from/to global device memory? Nsight Compute performance-metrics	2	895	February 21, 2022
Tracking particular memory addresses while profiling Nsight Compute	2	632	July 9, 2019
How to get memory access profile (over time) inside a kernel? Profiling Linux Targets nsight	4	861	May 3, 2023
How to read out the GPU DRAM Bandwidth with Nvidia Nsight system Jetson Nano cuda	4	1435	July 21, 2023
Does nvidia have any memory bandwidth testing tools and monitoring tools, such as intel's PCM and MCL tools System Management and Monitoring (NVML) tools	6	1319	July 24, 2023
Measuring Kernel Bandwidth CUDA Programming and Performance	6	2281	September 21, 2010

Measuring peak read/write bandwidth across device memory

Related topics