How to elegantly handle double arrays in shared memory without inducing bank conflict?

edwardliang11 · July 28, 2019, 1:35pm

Hi,

I am trying to process some double arrays in the shared memory. I need to access each element by linear addressing, namely, I need the fisrt thread to access the first element, the second thread to access the second element, etc. It seems to me that this will induce a two-way bank conflict. How can I resolve this issue?

Please note that I need high precision for my computations, so replacing double with float is not an option for me.

cbuchner1 · July 28, 2019, 2:36pm

You’re not stating which hardware (compute architecture) you’re developing on, which is critical information to make a meaningful performance related suggestion.

A related thread may be: https://devtalk.nvidia.com/default/topic/1039256/selecting-the-8-bytes-banks-of-shared-memory/

My understanding is that on Kepler you need to select 8 byte shared memory bank mode, and on later architectures (Maxwell, Kepler, Pascal, Volta) this is no longer supported or necessary.

Topic		Replies	Views
bank conflict in fermi for doubles CUDA Programming and Performance	0	1278	June 17, 2010
64-bit shared memory with minimal bank conflict? CUDA Programming and Performance	4	2642	March 21, 2016
Understanding bank conflicts in shared memory (fermi) CUDA Programming and Performance	4	11629	August 16, 2010
confusion about 64 bit shared memory access CUDA Programming and Performance	1	1306	May 10, 2012
Selecting the 8 bytes banks of shared memory CUDA Programming and Performance	9	3672	January 11, 2021
Bank Conflict when each thread accesses 2 elements CUDA Programming and Performance	8	5739	July 9, 2010
Shared memory bank conflict CUDA Programming and Performance	4	623	July 30, 2025
How are types larger than 4 bytes stored in shared memory, and how does this relate to bank conflicts CUDA Programming and Performance	6	259	April 20, 2025
Resolve 1D shared memory bank conflict with paddling CUDA Programming and Performance cuda , kernel	9	420	September 1, 2024
Bank conflicts with 2D shared mem array Resolving bank conflicts CUDA Programming and Performance	1	2069	July 18, 2008

How to elegantly handle double arrays in shared memory without inducing bank conflict?

Related topics