Dram transactions (bytes) question!

user40368 · June 9, 2025, 1:37am

I have a question about the DRAM sectors metric, specifically dram__sectors.sum. My understanding is that this value should represent the number of L2 cache misses.

However, when I try to calculate it by multiplying the total number of L2 sectors (lts__t_sectors.sum) by the L2 miss rate (1 - lts__t_sector_hit_rate.pct), the result doesn’t match the value reported by dram__sectors.sum.

Here’s an example from a GEMM application:
lts__t_sector_hit_rate.pct (%) 99.37
lts__t_sectors.sum (sector) 27,430,992
dram__sectors.sum (sector) 98,312

Shouldn’t dram__sectors.sum be equal to lts__t_sectors.sum * (1 - lts__t_sector_hit_rate.pct/100)?
Why is there a discrepancy between these values? Am I misunderstanding how these metrics are defined?

Any clarification would be appreciated!

rs277 · June 9, 2025, 7:59am

Edited after realising I hadn’t read the question properly.

I think dram__sectors.sum measures only traffic read from DRAM, on the input side to the L2 cache.

Greg · June 9, 2025, 9:40pm

dram__sectors.sum will include all read and write sectors at the memory controller.

lts__t_sectors.sum includes tag sector lookups which can hit or miss. This is focused on L2 data RAM accesses.
lts__t_sectors_lookup_miss.sum is not equivalent to dram__sectors.

Here are some considerations on the differences:

Requests to L2 that miss in L2 can result in both fill sectors and evict write-back sectors if selected line is dirty.
Writes (hit or miss) may generate write-through sectors to dram. This is not predictable as L2 is point of coherence a write-through is not always required for device memory.
Compressed reads/writes can result in additional dram traffic (on hit or miss).

veraj · June 27, 2025, 9:27am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
L2 sectors question! Nsight Compute	2	255	June 27, 2025
How to compute dram__bytes_read.sum & dram__bytes_read.sum Nsight Compute	2	504	November 8, 2024
Dram_sectors_read.sum cofusing in Nsight Compute CUDA Programming and Performance cuda , nsight	5	141	February 13, 2025
What is the difference between metric lts__t_sectors_aperture_sysmem_op_read.sum and lts__d_sectors_fill_sysmem.sum Nsight Compute	0	502	December 21, 2020
Instructions and DRAM stats Nsight Compute	0	400	April 30, 2020
DRAM metrics at SM or device level? Nsight Compute	5	815	October 12, 2021
Device memory store sectors Nsight Compute	4	602	September 21, 2023
Different betweent in lts__t_sectors_srcunit_tex_op_read.sum and lts__t_bytes.sum Nsight Compute	5	703	June 24, 2024
L2 throughput metrics Nsight Compute	2	1036	October 12, 2021
Nsight compute "Sectors Misses to L2" greater than "Sectors" Nsight Compute cuda	2	523	September 27, 2021

Dram transactions (bytes) question!

Related topics