[SOLVED] Shared memory variable declaration

aLbErT_h · December 23, 2016, 3:08pm

Hi,

I’m trying to declare two shared memory arrays inside a Cuda kernel. The size of arrays are dynamic and I’m using an extern variable with the size determined at runtime.

__global__ myKernel()
{
    extern __shared__ int localSum1[];
    extern __shared__ int localSum2[];
    ...
    int i_local = threadIdx.x + blockDim.x*threadIdx.y + blockDim.x*blockDim.y*threadIdx.z; 
    localSum1[i_local] = 0;
    localSum2[i_local] = 0;
    localSum1[i_local] += 1;
    localSum2[i_local] += 2;
    ...    
}

int main()
{
    ...
    myKernel<<<gridSize, blockSize, <b>localSize</b>>>>(); //local size is the size of arrays
    ...
}

If I compare the values from the two arrays they are the same and these ones should be different. I’m not sure if I’m using correctly the kernel configurations, I know that for one extern shared array declared inside the kernel it works fine, but now I have two arrays inside the kernel and I think that the space of memory is shared between the two arrays.

May be should I configurate the kernel with two localSize? Something like this:

int main()
{
    ...
    myKernel<<<gridSize, blockSize, localSize1, localSize2>>>();
    ...
}

Thank you.

Robert_Crovella · December 23, 2016, 3:13pm

This is covered in the programming guide:

[url]http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#shared[/url]

When using dynamically allocated shared memory, only one pointer to the allocated space will be given to the kernel code. If you want to divide up that space, you must do so yourself. Please read the above linked programming guide section for an example.

aLbErT_h · December 23, 2016, 3:22pm

Sorry,

I read the point C.3.1.6.3 of the programming guide but I hadn’t read the point B.2.3

https://docs.nvidia.com/cuda/cuda-c-programming-guide/#shared-memory-variable-declarations

Thank you.

Robert_Crovella · December 23, 2016, 3:32pm

Section C discusses concepts specific to CUDA Dynamic Parallelism. It does not present a general treatment of the topic.

Topic		Replies	Views
dynamic shared memory? CUDA Programming and Performance	4	1307	April 23, 2010
dynamic array in shared memory CUDA Programming and Performance	2	1932	October 16, 2015
shared memory dynamic allocation multiple arrays in shared memory allocated dynamically ?? CUDA Programming and Performance	2	8977	December 29, 2009
Several "extern __shared__" statements on a code CUDA Programming and Performance	2	1266	March 17, 2010
shared memory dynamic allocation multiple arrays in shared memory allocated dynamically ?? CUDA Programming and Performance	0	856	December 28, 2009
Dynamically allocating memory inside __device/global__ CUDA kernel CUDA Programming and Performance	3	1333	August 11, 2013
A question of using shared memory CUDA Programming and Performance	5	5378	March 12, 2008
Efficient way of reading dynamic array in kernel? CUDA Programming and Performance	5	1613	July 12, 2010
__shared__ memory: Just a question what happens if CUDA Programming and Performance	3	844	March 15, 2016
Question about variables inside a kernel CUDA Programming and Performance	5	2359	January 22, 2008

[SOLVED] Shared memory variable declaration

Related topics