Are Lovelace GPU L2 caches partitioned like the Ampere ones?

epk · September 27, 2024, 7:39pm

I know that with Ampere, NVIDIA GPU’s L2 cache got partitioned in two, as described in this blog article:

what about Ada Lovelace? The L2 cache has grown further. Is it still partitioned into two parts? Or has this changed (maybe into 4 partitions)?

Greg · September 28, 2024, 5:22am

The Ada architecture does not have a partitioned L2 like A100 and H100. Ada chips have a larger capacity L2 cache; however, it is the number of memory client request and response ports to L2 request and response ports that require the more recent 100 class chips to have partitioned L2s, not the capacity.

epk · September 28, 2024, 10:12am

Ah, so, I’m guessing GA-102 isn’t partitioned either? And the Blackwell “Hopper successor” will be?

Robert_Crovella · September 28, 2024, 11:56am

blackwell is partitioned in the sense that the high-end blackwell datacenter GPU consists of 2 dies, and with a bit of searching you can find statements made by one of our VP’s Bryan Catanzaro about that. The technical overview doc has this to say:

This architecture is able to incorporate a significant amount of computing power by
merging two dies into a single, unified GPU. Each of the two dies are the largest die
possible within the limits of reticle size, as big as can possibly be built today. The two dies
are connected and unified with a single 10 terabyte-per-second (TB/s) chip-to-chip NVIDIA
High-Bandwidth Interface (NV-HBI), providing one fully coherent, unified GPU.

rs277 · September 28, 2024, 5:53pm

Possibly not, if the L2 cache representation in Nsight Compute is accurate. Could not find a post with 3090 memory shown.

4090 vs A100, with the latter showing the bisection of the L2 block.

Topic		Replies	Views
How to utilize L2 partition? CUDA Programming and Performance	4	1047	January 18, 2023
Specifying L2 cache partition for SM CUDA Programming and Performance	2	163	December 19, 2025
Is it possible to partition l2 cache? CUDA Programming and Performance	2	99	March 13, 2025
A100 & RTX3090 Memory Similarities and Differences CUDA Programming and Performance	7	1868	September 28, 2022
Meanings of L2 --> L2 copy Nsight Compute	1	739	January 17, 2022
Any information on GPU on-die memory architecture? CUDA Programming and Performance	4	2443	August 28, 2019
A100 L2 Partition Bandwidth CUDA Programming and Performance	3	490	June 4, 2024
L2cache size of A800 80GB CUDA Programming and Performance	3	921	April 17, 2024
Cache # and Slice # significance in an Xid 48 Message (Hopper) Linux architecture-and-design	0	75	April 8, 2025
Instruction cache size for Ampere and Volta Arch nvc, nvc++ and nvfortran	2	1292	April 28, 2023

Are Lovelace GPU L2 caches partitioned like the Ampere ones?

Related topics