Shared memory per block

kidanime3d · March 27, 2012, 1:36pm

Apologises for the noob question. Recently ran the deviceQuery program in the SDK, and it states there is 48KB available for block, is it referring to the SM? As this seems to conflict with Compute 2.1

Device 0: "GeForce GT 530"

  CUDA Driver Version / Runtime Version          4.1 / 4.1

  CUDA Capability Major/Minor version number:    2.1

  Total amount of global memory:                 1024 MBytes (1073283072 bytes)

  ( 2) Multiprocessors x (48) CUDA Cores/MP:     96 CUDA Cores

  GPU Clock Speed:                               1.40 GHz

  Memory Clock rate:                             793.00 Mhz

  Memory Bus Width:                              128-bit

  L2 Cache Size:                                 131072 bytes

  Max Texture Dimension Size (x,y,z)             1D=(65536), 2D=(65536,65535), 3D=(2048,2048,2048)

  Max Layered Texture Size (dim) x layers        1D=(16384) x 2048, 2D=(16384,16384) x 2048

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       49152 bytes

  Total number of registers available per block: 32768

  Warp size:                                     32

  Maximum number of threads per block:           1024

  Maximum sizes of each dimension of a block:    1024 x 1024 x 64

  Maximum sizes of each dimension of a grid:     65535 x 65535 x 65535

  Maximum memory pitch:                          2147483647 bytes

  Texture alignment:                             512 bytes

seibert · March 29, 2012, 8:27pm

The 48 kB of shared memory per block is the default configuration for compute capability 2.x. You can change it at runtime to be 16 kB (to give 48 kB to the L1 cache).

Topic		Replies	Views
Question about max shared memory in block and multiprocessor CUDA Programming and Performance	2	604	February 20, 2024
Shared memory CUDA Programming and Performance	2	6853	April 14, 2011
CUDA Device Query says P100 has 48kb shared memory/block... but it's supposed to be 64kb? CUDA Programming and Performance	3	1085	June 4, 2017
Understanding deviceQuery CUDA Programming and Performance	2	4061	June 28, 2014
shared memory vs local memory CUDA Programming and Performance	1	8057	December 12, 2011
What is the default Shared Memory size per block in GTX 480 ? CUDA Programming and Performance	6	3325	March 21, 2012
maximum shared memory size CUDA Programming and Performance	3	7101	June 2, 2015
number of gpu's CUDA Programming and Performance	1	1384	May 12, 2009
Beginner questions on memory spaces CUDA Programming and Performance	2	2506	February 3, 2011
device organization CUDA Programming and Performance	1	4162	April 6, 2008

Shared memory per block

Related topics