Does this have bank conflict?

casybaby · October 30, 2008, 10:36pm

Hi all,

I have an array A in my shared memory, each thread reads two neighbor elements of them. For example, thread 0 reads A[0], A[1], thread 1 reads A[1], A[2]. Does it have bank conflict?

Thank you.

Casy

QD4_33 · October 31, 2008, 3:49pm

Yes, you have a bank conflict at A[1] …

When I try to access memory like this ( for example in a reduction) I use indexes like the following:

n is the number of elements:
Thread[ i ]: A[ i ] and A[ i + n >> 1 ]
… ( n >> 1 is the same as n / 2 if n is a power of 2 )

You also can use something like

unsigned int i = blockIdx.x * ( blockDim.x * 2 ) + threadIdx.x; // TODO adept
unsigned int ib = i + blockDim.x;
Thread[ i ] accesses A[ i ] and A[ ib ]

This options don’t have bank conflicts but the number of elements should be even.
( Have a look at SDK’s reduction sample! ;) )

pstach · October 31, 2008, 8:10pm

This response makes some assumptions which are incorrect.

Let say we have the example of:

__shared__ uint foo[BLOCKSIZE + 1];

uint val1, val2;

val1 = foo[threadIdx.x];

val2 = foo[threadIdx.x + 1];

There is no bank conflict.

Now if you were doing something along the lines of:

__shared__ uint foo[BLOCKSIZE + 1];

uint2 val;

val = (uint2 *) &foo[threadIdx.x];

This would create a bank conflict.

alex_dubinsky · October 31, 2008, 10:09pm

This response makes some assumptions which are incorrect.

Let say we have the example of:
__shared__ uint foo[BLOCKSIZE + 1];

uint val1, val2;

val1 = foo[threadIdx.x];

val2 = foo[threadIdx.x + 1];
There is no bank conflict.

Now if you were doing something along the lines of:
__shared__ uint foo[BLOCKSIZE + 1];

uint2 val;

val = (uint2 *) &foo[threadIdx.x];
This would create a bank conflict.

There would be a quote-unquote “bank conflict.” But in fact both scenarios will simply execute in two cycles.

Topic		Replies	Views
Problems Understanding Bank Conflicts CUDA Programming and Performance	1	1712	September 16, 2009
bank conflict question CUDA Programming and Performance	3	2287	December 28, 2009
CUDA Reduction CUDA Programming and Performance	2	1750	March 1, 2009
shared memory bank conflicts cc 2.0 CUDA Programming and Performance	3	892	December 29, 2011
bank conflicts...? CUDA Programming and Performance	2	3320	January 30, 2008
Bank Conflict when each thread accesses 2 elements CUDA Programming and Performance	8	5577	July 9, 2010
shared memory bank conflicts when reading? CUDA Programming and Performance	5	2545	August 3, 2007
Help understanding bank conflicts in transpose example CUDA Programming and Performance	5	6622	February 8, 2009
Bank conflicts on same address CUDA Programming and Performance	5	2833	April 28, 2009
Will this code cause bank conflict ? CUDA Programming and Performance	1	446	October 9, 2018

Does this have bank conflict?

Related topics