Shared Memory variables ? In multiple kernel invocations

kartik14 · July 11, 2008, 2:07am

Hi,

I have to call a kernel several times in my program.

Suppose I have a shared memory array of size 1000x 1000. Can I just fill in few elements in each invocation ?

Will the data filled in one invocation still be available in the next invocation ? I know that this is true in case of a global memory array.

Can anyone tell me how to do this, because my performance with global memory is not too good…

Thanks

Sibi_A · July 11, 2008, 5:34am

You can’t have shared memory of size 1000x1000.
It is limited to 16KB per block. (However you can operate on a 1000x1000 sized global memory array using shared memory :smile: )

Shared memory had block scope only. Which means data written in to shared memory by one block will not be available for the next block.

If you are trying to use/learn shared memory for the first time, I will recommend you to read the following presentation from Mark Harris.

[url=“http://www.gpgpu.org/sc2007/SC07_CUDA_5_Optimization_Harris.pdf”]http://www.gpgpu.org/sc2007/SC07_CUDA_5_Op...tion_Harris.pdf[/url]

Shared memory can be used for the following purposes

For achieving coalesced memory access.
To minimize data accessing from global memory (Repeated reads)

See fig 1.5 in NVIDIA_CUDA_Programming_Guide_1.1.pdf
(I think this fig is not available in 2.0beta2 programming guide)

kartik14 · July 11, 2008, 6:48am

Thanks for the tip External Image

Topic		Replies	Views
array size in shared memory beginner's question CUDA Programming and Performance	5	2889	May 14, 2010
CUDA: Using shared memory between different kernels.. CUDA Programming and Performance	4	16496	July 21, 2017
shared memory computation CUDA Programming and Performance	0	2105	September 30, 2010
Dynamic Shared memory CUDA Programming and Performance	3	6159	June 4, 2009
limitated amount of global memory for a kernel? CUDA Programming and Performance	3	2624	August 23, 2007
shared memory allocation among thread blocks CUDA Programming and Performance	3	1909	March 3, 2008
shared memory exact usable size 16kb less 256?? CUDA Programming and Performance	9	1064	November 3, 2010
Shared memory: released when unneded? CUDA Programming and Performance	4	3223	July 25, 2008
Scope of shared memory in CUDA CUDA Programming and Performance	12	4027	November 27, 2015
shared memory CUDA Programming and Performance	3	858	March 12, 2015

Shared Memory variables ? In multiple kernel invocations

Related topics