I have a kernel that I would like to perform both coalesced reads and writes that are to and from the same global memory array. However, I have been unable to to this unless my writes are coalesced from shared memory, to an array in global memory, that is diffrent from the global memory array for which I coalesces my reads from.
Is there a way aound this?
Also, is there a way to check if I am trueley coalescing (besides keeping track of the numbers)? Is there a function or a Macro or a message I can check for?