Wierd thing in Shared Memory Looking for an explanation

sabkalyan · January 28, 2011, 6:31am

Hi,

I am rather new to CUDA … so kindly excuse me if I am asking a rudimentary question.

I understand that any variable declared with the qualifier shared resides in the SMEM and is accessible by all threads within a block.
Consider a very simple kernal to transfer contents of an array b into an array c using SMEM as follows,

(Here lets say we have 10 threads in a block and lets say we have only one block)

global void(int *b,int *c)
{
shared int i[10];

i[threadIdx.x]=b[threadIdx.x];
c[threadIdx.x]=i[threadIdx.x];
}

Now if I transfer the variable c to the host and print it, it must have the contents of b in it, which is perfectly fine.

Now…

my program is looks something like this,

_global void(int *b,int *c)
{
shared int i;

i=b[threadIdx.x];
c[threadIdx.x]=i;
}
Here instead of an array I am declaring a variable i in the SMEM. The thing is, I am getting the same result as above (i.e. contents of b perfectly copied into c), which I dont think should be happening since i resides in the SMEM and all threads can access i … and all threads will write into i simultaneously!! … Ideally I would expect some garbage values in c.

Is there a gap in my understanding of SMEM ? Kindly help…

Thanks in advance !!

avidday · January 28, 2011, 6:44am

Probably just compiler optimzation. In the second example the variable i can be removed and the code simplified because i isn’t used anywhere else in the code.

Topic		Replies	Views
Question about shared variables CUDA Programming and Performance	2	5560	December 12, 2010
shared memory CUDA Programming and Performance	3	1539	June 14, 2011
Shared memory and multiple blocks CUDA Programming and Performance	2	2372	March 16, 2011
About __device__ __shared__ variable CUDA Programming and Performance	2	2675	February 27, 2008
Scope of shared memory in CUDA CUDA Programming and Performance	12	3831	November 27, 2015
Shared memory access of many threads CUDA Programming and Performance	2	2817	December 4, 2008
Shared Mem (w/ & w/out extern) CUDA Programming and Performance	2	2294	October 2, 2009
problem with __shared__ on device emulator CUDA Programming and Performance	1	3518	February 24, 2009
stupid memory question CUDA Programming and Performance	11	2066	July 28, 2009
Shared Memory allocation.. CUDA Programming and Performance	5	5349	July 9, 2010

Wierd thing in Shared Memory Looking for an explanation

Related topics