It says in the Appendix A of the programming guide (A.1.1) that “the amount of shared memory available per multiprocessor is 16KB organized into 16 banks.”
So each bank has 1KB of memory space available, right ?
At 5.1.2.5 in the programming guide, it says : “In the case of the shared memory space, the banks are organized such that successive 32-bit words are assigned to successive banks and each bank has a bandwidth of 32 bits per two clock cycles.”
Does it mean that only 32 bits = 4 Bytes among the 1KB available per bank is used ?
Thanks for your concern and sorry if this question has already been discussed. I couldn’t find any related topic at first search.