Shared memory banks and Warp size New Warp Size in the Future?

cuco · January 22, 2008, 5:56pm

The way that I understand shared memory bank conflicts is that each thread in a half warp should be accessing a different bank. There are 16 memory banks and 32 threads in a Warp.

The part that is bothering me is what happens in the future if the Warp size changes.

1 - Is it possible for the Warp size to go larger or smaller for future processors?

2 - If smaller, let’s say a new Warp size of 16, will the rule remain that there are no bank conflicts for threads within the half warp, i.e., 8 in this case, or will it remain at 16?.

DenisR · January 22, 2008, 6:03pm

I remember having read that it might become a 32 way bank. So you should prevent conflicts in a full warp.

But best would be to have some #define WARP_SIZE and #define BANK_SIZE and have your code using these defines to avoid bank conflicts and to optimize your code.

Mark_Harris · January 22, 2008, 10:01pm

In the future the warp size may change, as may the number of banks. Unfortunately that’s all I can say at this time.

Mark

cuco · January 22, 2008, 11:08pm

Mark and Denis, thanks for the responses.

Mark, if and when you change things like the Warp size or number of banks, etc in the GPU, are you planning to also change the device version #, e.g. 1.2.

That is, how would you recommend that developers design code that performs well with G80 and Tesla but that can scale for future Gxx processors. Would you recommend reading the description of the card installed from the properties provided by CUDA, or the Warp size and other info in the properties structure or the version of the CUDA runtime?

mfatica · January 22, 2008, 11:15pm

Use cudaGetDeviceProperties ( like in the deviceQuery example)

Device 0: “Tesla C870”
Major revision number: 1
Minor revision number: 0
Total amount of global memory: 1610350592 bytes
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 16384 bytes
Total number of registers available per block: 8192
Warp size: 32
Maximum number of threads per block: 512
Maximum sizes of each dimension of a block: 512 x 512 x 64
Maximum sizes of each dimension of a grid: 65535 x 65535 x 1
Maximum memory pitch: 262144 bytes
Texture alignment: 256 bytes
Clock rate: 1350000 kilohertz

Mark_Harris · January 22, 2008, 11:44pm

Unfortunately that doesn’t (currently) tell you the number of banks.

I’m pretty confident that any change in the bank configuration will correspond to a change in compute capability / SM version.

Mark

Topic		Replies	Views
Shared Memory Bank Conflict Clarification CUDA Programming and Performance	2	783	April 16, 2011
How to understand the bank conflict of shared_mem CUDA Programming and Performance	12	11880	January 16, 2025
dont understand bank conflicts for shared mem CUDA Programming and Performance	7	2664	March 31, 2010
Shared Memory Bank Conflicts CUDA Programming and Performance	3	2325	February 24, 2012
the relation between Thread Index and Shared Memory CUDA Programming and Performance	4	3248	February 14, 2009
Compute capability 2.1 - no. of banks CUDA Programming and Performance	2	1625	April 18, 2012
CUDA9 and memory bank conflicts CUDA Programming and Performance	7	2381	November 7, 2017
CC5.0 Will bank conflict occur between different warps? CUDA Programming and Performance	6	2394	November 13, 2014
shared memory accesses for different compute capabilities CUDA Programming and Performance	2	2850	July 29, 2011
handle bank conflicts on shared memory of Fermi devices? How does the hardware work CUDA Programming and Performance	5	6934	November 15, 2010

Shared memory banks and Warp size New Warp Size in the Future?

Related topics