Turing L2 cache

mahmood.nt · September 26, 2020, 9:04pm

The deviceQuery for 2080Ti says

(68) Multiprocessors, ( 64) CUDA Cores/MP:     4352 CUDA Cores
L2 Cache Size:                                 5767168 bytes

Considering the fact that L2 is shared among all SMs, 5767168/68=84811.2941 which is not a power of 2 number. Usually, the number of sets, ways and block size are power of 2. For that number, we can estimate (S=41)(W=16)(B=128) which yields 83,968‬ bytes or (S=44)(W=15)(B=128) which yields 84,480 bytes.

Besides microbenchmarking, I am curious to know if there are more information about that.

Robert_Crovella · October 16, 2020, 1:42am

Here are some hints:

https://www.nvidia.com/content/dam/en-zz/Solutions/design-visualization/technologies/turing-architecture/NVIDIA-Turing-Architecture-Whitepaper.pdf
5632KB /512KB = 11
The TU102 die has 12 memory controllers (each corresponding to 32 bits of bus width), but the 2080Ti only has a 352-bit memory bus width.

Topic		Replies	Views
L2 size per SM CUDA Programming and Performance	2	520	October 12, 2021
L2cache size of A800 80GB CUDA Programming and Performance	3	906	April 17, 2024
Actual L1 size in Volta and Turing CUDA Programming and Performance	5	1732	December 29, 2019
Jetson TX2 Cache Line Size Jetson TX2	10	2618	October 18, 2021
Cache size effect CUDA Programming and Performance cuda	4	120	March 17, 2025
L2 Texture Cache CUDA Programming and Performance	10	3348	July 5, 2010
Maximum size for persisting L2 cache CUDA Programming and Performance cuda	4	920	October 25, 2022
Where can I easily find the L1 and L2 cache line size per compute capability? CUDA Programming and Performance	1	395	July 2, 2024
Memory transaction size CUDA Programming and Performance	1	1797	February 12, 2017
Questions about L2 texture cache CUDA Programming and Performance	1	6990	December 19, 2007

Turing L2 cache

Related topics