One main conclusion I took home from this year CUDA workshop in Dresden was that memory access to global memory is NOT cached.
Now I read a paper from Govindaraju, et al. (see here), stating that the G80 has a Cache size of 392KB.
So memory access to global memory is cached after all?
Thanks and best regards,