where is the another 32 byte shared memory

shong · July 20, 2009, 4:38am

Hi everyone,

recently, I optimize my programme, I found the the size of shared memory in cubin file is greater than I used in the kernel (only greater 32byte). I wonder where I use these shared memory?

[codebox]/************************************************************

************/

/* calculate the maximum and minimum of a vector */

/************************************************************

************/

template

global void CalMaxMinD1(int np, float* idata, float* odatamax, float* odatamin)

{

int tid = threadIdx.x;

int i = __mul24(blockIdx.x, blockDim.x) + threadIdx.x;

int gridsize = __mul24(gridDim.x, blockDim.x);

__shared__ float maxtemp[256];

__shared__ float mintemp[256];

// load

maxtemp[tid] = idata[i];

mintemp[tid] = maxtemp[tid];

i += gridsize;

while (i < np)

{

	float temp = idata[i];

	if (maxtemp[tid] < temp) { maxtemp[tid] = temp; }

	if (mintemp[tid] > temp) { mintemp[tid] = temp; }

	i += gridsize;

}

__syncthreads();

…

do the reduce works

…

// output

if (tid == 0)

{

	odatamax[blockIdx.x] = maxtemp[0];

	odatamin[blockIdx.x] = mintemp[0];

}

}[/codebox]

In this code, I only use two shared memory array. it is totally 2564Byte2 = 2048. While in the cubin file, it shows I used 2080 Byte shared memory. Could you please tell me why?

[codebox]code {

name = _Z11CalMaxMinD1ILi2EEviPfS0_S0_

lmem = 0

smem = 2080

reg  = 8

bar  = 1

const {

		segname = const

		segnum  = 1

		offset  = 0

		bytes   = 4

	mem {

		0x0000001f 

	}

}

bincode {.......

}

}[/codebox]

peter

seibert · July 20, 2009, 5:16am

CUDA uses some of the shared memory for storing kernel parameters and block/grid dimensions.

shong · July 21, 2009, 8:55am

Thanks.

peter

Topic		Replies	Views
Shared memory usage CUDA Programming and Performance	1	2489	April 10, 2007
One question regarding shared memory CUDA Programming and Performance	5	1237	April 24, 2013
Max shared memory CUDA Programming and Performance	2	1487	December 3, 2008
Strange Compiler Shared Memory Usage CUDA Programming and Performance	5	6574	November 19, 2009
Need Help with Shared Memory Allocation for 1D and 2D Arrays in CUDA CUDA Programming and Performance	15	583	May 16, 2024
Max shared memory CUDA Programming and Performance	0	1269	July 28, 2020
shared memory Computation become slower when using the shared memory CUDA Programming and Performance	8	1817	August 20, 2010
shared memory CUDA Programming and Performance	4	3266	April 24, 2007
Newbie question: Shared memory CUDA Programming and Performance	7	2714	July 12, 2008
shared memory and CUDA calculator CUDA Programming and Performance	6	4041	October 26, 2008

where is the another 32 byte shared memory

Related topics