Hello, I have a simple question with l2 cache partition in ampere architecture.
I saw overall ppt of “Dissecting the Ampere GPU Architecture through Microbenchmarking” which was presented in NVIDIA GTC 2021. As far as I understood, it is possible to set a SM to load data from specific partition of L2 cache(Ampere architecutre a100 has 2 partitions of L2 cache).
Can anyone please tell me how to do it or send me a link explaning how to do it?
Or it would be thankful to notify me if i misunderstood the contents of presentation.
Thank you in advance!