Shared memory problem of above 48 KB requires dynamic shared memory?

Julier · May 10, 2021, 9:50am

Hi everybody!
There have a question about shared memory. My GPU is NVIDIA Geforce RTX 2080 with compute capability 7.5, accroding to the document of CUDA C Programing Guide.pdf, I know Maxium amount of shared memory per thread block is 64 KB and there have a tip that above 48kb requires dynamic shared memory. But I can’t use shared memory between 48kb and 64kb by the way of dynamic shared memory or static shared memory. I don’t know why?
Thank you for your answer!

Julier · May 10, 2021, 10:10am

Blow is my code for testing the dynamic shared memory, but it’s not working!
tip: 90*90*sizeof(double) is approaching 64KB, when modify it to 78*78*sizeof(double), approaching 48 KB, it start working.

static __global__
void Gauss_Jordan_Inverse(double* mat_tmp, int n) {

	int idx = threadIdx.x;
	int idy = blockIdx.x;

	
	double * mat_tmp1 = mat_tmp + idy * n * n;
	extern __shared__ double mat[];
	Matrix_copy_glob2shr(mat_tmp + idx*n , mat + idx * n, n);


	for (int i = 0; i < n; i++) {
		mat[idx * n + i] += 1;
	}

	__syncthreads();
	
	Matrix_copy_shr2glob(mat+idx*n, mat_tmp+idx*n, n);
}


static void fun(double* out ,int size){
    //do something...
    Gauss_Jordan_Inverse << <1, 1,90*90*sizeof(double)>> > (d_out, size);
   //do something...
}

cbuchner1 · May 10, 2021, 11:43am

You’re missing this
See here: Shared memory size per Thread Block

// Host code
int maxbytes = 65536; // 64 KB
cudaFuncSetAttribute(Gauss_Jordan_Inverse, cudaFuncAttributeMaxDynamicSharedMemorySize, maxbytes);

I’d kindly advise to use the forum’s search function before posting.

Julier · May 11, 2021, 8:56am

Thank you very much for your answer and advise. it starts work.

1596604120 · June 17, 2021, 11:05am

It’s very good

Topic		Replies	Views
Max shared memory CUDA Programming and Performance	0	1330	July 28, 2020
Dynamic shared memory can not allocate problem(on 3050PC) CUDA Programming and Performance	2	494	July 20, 2022
Dynamic shared memory calculated by ncu larger than Max_shared_memory_per_block Nsight Compute cuda	4	774	September 21, 2023
How to allocate more than 48KB shared memory on A100? CUDA Programming and Performance	3	1152	April 29, 2023
Shared memory to Global memory data transfer nvc, nvc++ and nvfortran cuda	5	669	June 26, 2023
Shared memory size per Thread Block CUDA Programming and Performance	2	6933	May 17, 2019
Strange ptxas error in shared memory CUDA Programming and Performance	7	9246	February 24, 2009
regarding Shared mem CUDA Programming and Performance	1	3722	February 1, 2009
Size limit on dynamic allocated shared memory CUDA Programming and Performance	2	1525	November 6, 2008
Static allocation successfully for more than 48KB shared memory? CUDA Programming and Performance cuda , kernel	10	143	November 5, 2025

Shared memory problem of above 48 KB requires dynamic shared memory?

Related topics