confusion about 64 bit shared memory access

King_Crimson · May 6, 2012, 6:21pm

On the CUDA programming guide v4.2 section F.4.3.2, it says:

[i]64-Bit Accesses

For 64-bit accesses, a bank conflict only occurs if two threads in either of the half-warps access different addresses belonging to the same bank.

Unlike for devices of compute capability 1.x, there are no bank conflicts for arrays of doubles accessed as follows, for example: [/i]
extern __shared__ float shared[]; 

double data = shared[BaseIndex + tid];

question 1: is “float” a typo? shouldn’t it be “double”?

question 2: does it imply that the 64 bit memory access request is for half-warp rather than the entire warp? otherwise, access to, say, shared[0] and shared [16] by thread 0 and 16 is supposed to incur bank conflict, right?

Thanks for clarification! External Image

cudaDMA · May 10, 2012, 9:09pm

Banks conflicts are looked at half-warp level.

Topic		Replies	Views
shared memory bank conflicts cc 2.0 CUDA Programming and Performance	3	892	December 29, 2011
shared memory accesses for different compute capabilities CUDA Programming and Performance	2	2839	July 29, 2011
Understanding bank conflicts in shared memory (fermi) CUDA Programming and Performance	4	11533	August 16, 2010
dont understand bank conflicts for shared mem CUDA Programming and Performance	7	2611	March 31, 2010
do not understand bank conflicts please help CUDA Programming and Performance	7	2685	December 22, 2012
Shared Memory "Bank Conflicts" I'am confused... CUDA Programming and Performance	11	3457	August 20, 2009
bank conflict in cuda's parallel prefix scan GPU-Accelerated Libraries	1	1885	February 12, 2016
Shared memory coalescing Q:Is it a typo in the programming guide? CUDA Programming and Performance	2	2482	August 20, 2007
shared memory bank conflicts when reading? CUDA Programming and Performance	5	2545	August 3, 2007
Bank Conflicts CUDA Programming and Performance	2	1956	December 6, 2009

confusion about 64 bit shared memory access

Related topics