I have read the Programming Guide and Performance Guidelines, but shared memory bank conflict is still not clear to me.
- At any point of time, on a streaming multiprocessor, aren’t there more than 1 warp per block being executed? If no, that clarifies everything. If yes -
- Isn’t it highly likely that one bank is being accessed by more than 1 thread, since number of banks <<< number of possible threads.
1)If there is only 1 warp executed per block at any given time, then isn’t its implication that increasing the number of cores does not increase efficiency of solutions to fine grained parallelism problems?