I know that thread block clustering for accessing shared memory of different thread blocks has been a concept introduced in the Hopper architecture.
I wonder if thread block clustering is not supported in the Ada architecture that is parallel to Hopper. To be specific, the AD102 GPU.