Shared memory and multiple blocks

fender177 · March 16, 2011, 1:09am

Hi Everyone,

When I use shared memory within a kernel, is the shared memory variable created for each block of threads?

For example:

__global__ void SOMEKERNEL(double *a, double *b)

{

     __shared__ double c[512];

}

Does each block get c[512]? So if I had 10 blocks, would there be 10 copies of c[512]? Each of which I could load with data from global memory…? I’m trying to implement the reduction code from the SDK in one of my kernels, but my program is crashing and I want to make sure I’ve understood how using shared memory works.

Thanks in advance to any help!

tera · March 16, 2011, 1:47am

Yes, each running block gets its own set of shared memory variables.

fender177 · March 16, 2011, 12:02pm

Great, thanks!

Topic		Replies	Views
Shared memory and blocks CUDA Programming and Performance	2	3621	March 13, 2008
Use shared Memory CUDA Programming and Performance	3	430	December 26, 2019
Shared memory access of many threads CUDA Programming and Performance	2	2817	December 4, 2008
shared memory CUDA Programming and Performance	3	1539	June 14, 2011
Shared Memory allocation.. CUDA Programming and Performance	5	5349	July 9, 2010
Can I define more than one variable in shared memory? CUDA Programming and Performance	2	337	June 6, 2022
shared memory allocation among thread blocks CUDA Programming and Performance	3	1842	March 3, 2008
Shared Memory variables ? In multiple kernel invocations CUDA Programming and Performance	2	1877	July 11, 2008
shared memory and syncthreads question CUDA Programming and Performance	2	1211	March 3, 2009
How shared are shared variables? Can shared variables from separate function calls conflict? CUDA Programming and Performance	3	2502	July 17, 2011

Shared memory and multiple blocks

Related topics