Problems Understanding Bank Conflicts

_Brian · September 15, 2009, 10:53am

Hi,

i have some Problems understanding Bank Conflicts in shared memory.

In particular its the reduction example, which illustrates bank conflicts.

In the reduce1 example from the sdk:

[codebox]// do reduction in shared mem

for(unsigned int s=1; s < blockDim.x; s *= 2) 

{

    int index = 2 * s * tid;

if (index < blockDim.x)

    {

        sdata[index] += sdata[index + s];

    }

    __syncthreads();

}

[/codebox]

At the first Iteration each Thread access 2 successive Elements from shared memory.

In the Programming Guide its stated that: “any memory read or write request made of n addresses that fall in n distinct memory banks”

For my understanding thats the case in the example.

Thread 0 accesses Bank 0 and 1,

Thread 1 accesses Bank 2 and 3,

…

So there should not be any Bank Conflict in my opinion.

Can somebody clarify things a bit for me?

thanks,

Brian

_Brian · September 16, 2009, 7:00pm

I think i found the answer. If somebody else is interested:

Thread 0 till Thread 7 cause no Bank Conflict, but Thread 8 for example accesses Bank 0 like Thread 0 → Bank Conflict.

Its illustrated in the sdk programming guide 2.3 as well :">. Page 95, Figure 5-7, left: linear addressing with a stride of 2.

Thread can be closed…

Topic		Replies	Views
Shared memory bank conflicts CUDA Programming and Performance	1	2384	August 24, 2009
Does this have bank conflict? CUDA Programming and Performance	3	1527	October 31, 2008
CUDA Reduction CUDA Programming and Performance	2	1750	March 1, 2009
shared memory bank conflicts cc 2.0 CUDA Programming and Performance	3	892	December 29, 2011
Explanation of Shared Memory Bank Conflicts for Reduction Example? CUDA Programming and Performance	3	7721	March 14, 2010
Bank Conflict when each thread accesses 2 elements CUDA Programming and Performance	8	5577	July 9, 2010
Help understanding bank conflicts in transpose example CUDA Programming and Performance	5	6622	February 8, 2009
Question about bank conflict Chapter 5.1.2.4 of Cuda Prog. Guide CUDA Programming and Performance	2	2473	April 12, 2008
When bank conflicts in shared memory, serialized request is the order fixed? CUDA Programming and Performance cuda	4	25	August 12, 2024
bank conflicts...? CUDA Programming and Performance	2	3320	January 30, 2008