Question about max shared memory in block and multiprocessor

Rookie_programmer · February 20, 2024, 9:17am

Hi, community member,

when i tried to print my device max shared memory available, I find tow property in CUDA, that’s deviceProp.sharedMemPerBlock and deviceProp.sharedMemPerMultiprocessor.

In A800 GPU, the deviceProp.sharedMemPerBlock is 49152 bytes and the deviceProp.sharedMemPerMultiprocessor is 167936 bytes.

I know that a SM can run multiple blocks, my question is even a kernel have one block only, max shared memory available in this block is 49152 bytes?

For example, In A800 I have a kernel function where the number of blocks is equal to the number of sm, that is, 108.
In this case each SM run a block, although the maximum shared memory available of each sm is 167936 bytes, the maximum shared memory available of each block is 49152 bytes actually.
Is that true? Or CUDA will do some optimization when each SM runs a block, so that the maximum shared memory available of each block can be greater than 49152 bytes？

striker159 · February 20, 2024, 9:45am

48KB shared memory is the standard amount that is available on all current GPUs.
To be able to use more than 48KB you have to use dynamic shared memory (extern __shared__), and explicitly allow it using
cudaFuncSetAttribute(kernel, cudaFuncAttributeMaxDynamicSharedMemorySize, smem);

The maximum amount of shared memory supported in this way should be listed in deviceProp.sharedMemPerBlockOptin .

The programming guide lists this limit for all architectures in Table 18.
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications-technical-specifications-per-compute-capability

system · March 5, 2024, 9:45am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
NEWBIE:max size of shared memory of a block? CUDA Programming and Performance	3	3123	September 5, 2009
Shared memory size CUDA Programming and Performance	4	5128	October 5, 2009
Shared memory per block Related to shared memory of an MCPU CUDA Programming and Performance	3	4005	August 14, 2007
Shared memory per block CUDA Programming and Performance	1	6819	March 29, 2012
Max shared memory CUDA Programming and Performance	0	1288	July 28, 2020
shared memory size how much shared memory is available CUDA Programming and Performance	2	582	January 16, 2011
about maximum amount of shared memory per multiprocessor CUDA Programming and Performance	0	574	June 5, 2013
Dynamic shared memory calculated by ncu larger than Max_shared_memory_per_block Nsight Compute cuda	4	701	September 21, 2023
A little quire on shared memory. CUDA Programming and Performance	0	434	October 17, 2017
Not enough shared mem CUDA Programming and Performance	5	5805	November 3, 2009

Question about max shared memory in block and multiprocessor

Related topics