__shared__ char shared[32];
char data = shared[BaseIndex + tid];
Above code is the example that is given in the CUDA documentation, however, I’m not clear on why is:
shared[1], shared[2], shared[3], shared[4];
Shares the same memory banks… Well, it says that’s because if “s” is odd, then it will conflict, but I’d like to see some type of proof…? I can’t see this from all of the examples given…
Thanks,
Grady