shared memory banks

sashang · November 22, 2008, 5:43am

It’s not clear to me what the size of a shared memory bank is. The programming guide says

Does this mean that a shared memory bank is 32 bits? Don’t think so…

Also what’s an example of some code that will cause memory bank conflicts?

sashang · November 22, 2008, 5:57am

Ok I did some further reading and I think I see how it works now. There are 16 memory banks of 1KB each per multiprocessor. So when you access something in shared memory using, for example, a stride of 4 bytes eg:

Assume s is pointing to a uint32_t array.

In thread0:

s[0]

In thread1:

s[1]

In thread 2:

s[2]

etc…

Then those individual accesses are mapped to the physical memory banks 0-15 on the hardware. So, for example the absolute address of s[1] would be, roughly speaking, at the base_address_of_the_banks + sizeof(bank) + 4. Now because a bank is 1KB in size, then s[1] points to the 4th byte in the 1st memory bank. Does that sound right?

alex_dubinsky · November 23, 2008, 12:59am

No, it points to the 1st byte of the 2nd memory bank.

sashang · November 23, 2008, 6:09am

Sorry, my initial post may not have been clear. Firstly s is a pointer to a uint32_t array so s[1] will point to the 4th byte of that bank. Also I was counting memory banks from index 0 - 15, so I meant the bank at index 1 which is the 2nd bank as you point out.

alex_dubinsky · November 23, 2008, 8:09am

But why 4th byte?

sashang · November 23, 2008, 10:07am

Isn’t that how the array operator works? A uint32_t is 4 bytes.

Eg:

uint32_t* p = …

p[0] ----> 0th byte

           1st byte

           2nd byte

           3rd byte

p[1] ----> 4th byte

p[2]------>8th

        etc....

if p were a pointer to a uint8_t then it would look like this

p[0] ----> 0th byte

p[1]----->1st byte

p[2]----->2nd byte

alex_dubinsky · November 23, 2008, 6:36pm

s[0] points to byte 0 of bank 0
s[1] points to byte 0 of bank 1
s[2] points to byte 0 of bank 2

s[16] points to byte 4 of bank 0
s[17] points to byte 4 of bank 1
…

sashang · November 23, 2008, 9:02pm

Yes, my mistake.

Topic		Replies	Views
Shared Memory "Bank Conflicts" I'am confused... CUDA Programming and Performance	11	3461	August 20, 2009
question about the shared memory CUDA Programming and Performance	4	3865	October 30, 2007
you can access only 32 bits per bank on shared memory despite the fact a bank is 1ko ? CUDA Programming and Performance	2	3131	April 29, 2010
Share memory and banks CUDA Programming and Performance	1	3241	August 5, 2009
dont understand bank conflicts for shared mem CUDA Programming and Performance	7	2617	March 31, 2010
Does every thread block have its own 32 shared memory banks? CUDA Programming and Performance cuda	8	1493	February 6, 2023
Shared memory with compute capability 3.x (in 32-bit mode) or compute capability 5.x and 6.x CUDA Programming and Performance	5	973	November 17, 2017
About shared memory banks CUDA Programming and Performance	6	504	April 26, 2023
No clear concise data on GPU shared memory bank layout CUDA Programming and Performance	3	258	May 17, 2024
Shared memory bank conflicts with byte arrays CUDA Programming and Performance	4	3262	April 19, 2017

shared memory banks

Related topics