shared memory: Just a question what happens if

MartyMcFly · March 15, 2016, 10:43am

Hello,

there are examples in the docs that show that I can also declare a dynamically shared memory globally so that all CUDA kernels can access it:

shared float Data1;

The size of the shared memory is derived from the kernel configuration. But what happens if I have two lines of the above shared memory

shared float Data1;
shared float Data2;

The compiler does of course not complain. But what is the behavior here?

Also, what happens if I declare a global shared memory and two kernels are executed in parallel with different shared memory configurations?

Thanks for clarifying this?

Martin

Robert_Crovella · March 15, 2016, 2:12pm

You may want to read the documentation:

[url]http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#shared[/url]

Dynamically allocated shared memory must be used with the extern keyword, so your examples are not syntactically correct.

In answer to your question about having two (or more) such definitions, the documentation states:

“All variables declared in this fashion, start at the same address in memory,”

so the behavior is that Data1 and Data2 will point to the same location.

Shared memory cannot be a global definition. It can only have scope within a specific kernel definition. If you define a shared memory location at global scope it is as if you wrote that definition in each kernel in the compilation unit. The behavior therefore would be sorted out according to the rules already given in the programming guide

MartyMcFly · March 15, 2016, 4:07pm

Hi txbob, thanks for the info

You are right. I have already read it. But often it is not quite obvious in the docs how it really works and how to interpret the text that is written. Sometimes it helps to get a kick in the right direction.

Did you also think about the question what happens if two kernels are executed in parallel with a different shared memory configuration?

Thanks
Martin

Robert_Crovella · March 15, 2016, 4:30pm

Yes, and I responded in my posting starting with:

“Shared memory cannot be a global definition…”

Two kernels, running in parallel, can have a different shared memory configuration. shared memory is by definition within the local scope of a given kernel definition, so there is no connection between the shared memory of different kernels.

If kernel A requires 4KB/threadblock, and kernel B requires 6KB/threadblock, there is no conflict or confusion that I can see. Your original statement in this regard mentioned “global” shared memory which is an invalid concept.

Topic		Replies	Views
CUDA: Using shared memory between different kernels.. CUDA Programming and Performance	4	16034	July 21, 2017
[SOLVED] Shared memory variable declaration CUDA Programming and Performance	3	15026	December 23, 2016
persistent global memory? CUDA Programming and Performance	1	5053	April 30, 2010
Shared memory between kernels? CUDA Programming and Performance	1	2583	September 23, 2008
Continuing global memory output between kernels CUDA Programming and Performance	2	489	August 23, 2019
Is it possible to execute kernels in parallel CUDA Programming and Performance	9	4565	February 6, 2009
General Shared Memory Question CUDA Programming and Performance	5	6611	March 4, 2010
Global memory double pointer problem CUDA Programming and Performance	4	1621	June 5, 2009
More than one external shared memory declaration per kernel? CUDA Programming and Performance	6	1717	August 12, 2010
DMA to global memory while a kernel is running ? CUDA Programming and Performance	7	2073	December 19, 2008

__shared__ memory: Just a question what happens if

Related topics

shared memory: Just a question what happens if