How to utilize L2 partition?

202476410arsmart · November 23, 2022, 10:36am

Like shown here:

Ampere’s L2 cache is partitioned into two parts. If we can store the data near the corresponding L2, we can save time to move data! But how can we do that? Any example? Thank you!!!

rs277 · November 23, 2022, 6:05pm

There are some details in the Programming Guide here.

202476410arsmart · December 30, 2022, 6:04am

Thanks! I see how to use L2, but no how to find the “nearer corresponding L2”…

rs277 · December 30, 2022, 6:54am

I see now, in the “Dissecting…” reference above, that I misunderstood the information you were looking for.

I can’t help with your query, but I do see that the link I gave above to the Programming Guide section on L2 Access Management, is now broken, due to the new documentation layout.

As I can no longer edit the previous post, the current location for this is here.

jimmy.hj · January 18, 2023, 7:46am

I think you need aware the algorithm implementation, conclude the mapping between thread index and ld/st memory address pattern and generate the lookup table to map the virtual thread index and cuda level thread index.

Topic		Replies	Views
Specifying L2 cache partition for SM CUDA Programming and Performance	1	67	March 14, 2025
Are Lovelace GPU L2 caches partitioned like the Ampere ones? CUDA Programming and Performance	4	143	September 28, 2024
Is it possible to partition l2 cache? CUDA Programming and Performance	2	36	March 13, 2025
Learn about NVIDIA's Latest Announcements at Our Upcoming Webinars CUDA Setup and Installation	0	403	May 14, 2020
Learn about NVIDIA's Latest Announcements at Our Upcoming Webinars GPU-Accelerated Libraries	0	383	May 14, 2020
Learn about NVIDIA's Latest Announcements at Our Upcoming Webinars CUDA Developer Tools	0	320	May 14, 2020
Use of L2 cache CUDA Programming and Performance	13	277	March 26, 2025
Learn about NVIDIA's Latest Announcements at Our Upcoming Webinars CUDA	0	304	May 14, 2020
A100 L2 Partition Bandwidth CUDA Programming and Performance	3	348	June 4, 2024
A100 & RTX3090 Memory Similarities and Differences CUDA Programming and Performance	7	1727	September 28, 2022

How to utilize L2 partition?

Related topics