Understanding bank conflicts in shared memory (fermi)

crip_crop · August 14, 2010, 6:28pm

Hello there,

I just need someone to clarify that my understanding of bank conflicts in shared memory on the Fermi is correct. It says in the CUDA Programming Guide 3.1, that there can now be bank conflicts between threads in different half-warps in GPUs of compute capability 2.0. Is this because the number of banks has increased to 32, so two half-warps can access the banks at the same time. Hence, there’s potential for 2 threads in the different half-warps to access the same bank? Or is there another explanation, which I have missed.

I’m also a little unsure about why doubles are subject to 2-way bank conflicts in shared memory for compute capability 1.3. If anyone is a whizz at this and can explain it to me I would be very grateful indeed.

Cheers,
Crip-crop

crip_crop · August 14, 2010, 6:34pm

Actually, I’ve just worked out why doubles suffer 2-way bank conflicts. It’s because the doubles are split into 2 32-bit words and put into successive banks. So, for a half-warp accessing 16 doubles, there will be 2 threads accessing each each bank.

Still unclear on the first point however, so please reply with possible explanations.

Cheers,
Crip_crop

crip_crop · August 14, 2010, 6:34pm

Actually, I’ve just worked out why doubles suffer 2-way bank conflicts. It’s because the doubles are split into 2 32-bit words and put into successive banks. So, for a half-warp accessing 16 doubles, there will be 2 threads accessing each each bank.

Still unclear on the first point however, so please reply with possible explanations.

Cheers,
Crip_crop

ONeill · August 16, 2010, 10:18am

You r correct with your thought on the first point. Pre-Fermi accesses were handled per half-warp so it didnt matter if say thread 0 and 16 accessed the same bank. This has changed with Fermi as you said.

ONeill · August 16, 2010, 10:18am

You r correct with your thought on the first point. Pre-Fermi accesses were handled per half-warp so it didnt matter if say thread 0 and 16 accessed the same bank. This has changed with Fermi as you said.

Topic		Replies	Views
How to understand the bank conflict of shared_mem CUDA Programming and Performance	12	10934	January 16, 2025
Shared Memory Bank Conflict Clarification CUDA Programming and Performance	2	777	April 16, 2011
Shared memory bank conflict CUDA Programming and Performance	1	308	May 19, 2024
Does this code cause bank conflicts? Nsight Compute cuda , kernel	4	1349	September 6, 2024
Requesting clarification for Shared Memory Bank Conflicts and Shared memory access? CUDA Programming and Performance hw , cuda	11	4273	January 23, 2024
Trade-off Between Bank Conflict and Thread Count in Shared Memory Access CUDA Programming and Performance cuda	9	63	June 23, 2025
float4 Shared memory doesn't yield bank conflict according to nvprof when it should CUDA Programming and Performance	4	1941	January 13, 2024
Shared memory bank conflicts CUDA Programming and Performance	1	2395	August 24, 2009
128-bit access bank conflict CUDA Programming and Performance	11	1009	March 29, 2024
Why there is random bank conflicts? CUDA-MEMCHECK cuda	2	1211	September 19, 2023

Understanding bank conflicts in shared memory (fermi)

Related topics