The texture cache is quite small (8 K) and shared by 3 multiprocessors. Cached data will pushed out soon by new data.
Are there more information about how the cache is really working … ?
How is it organized? Is the cache using “cache-lines” like on the cpu and what is ment by locality and
in which directions (2D-access)?
Which access pattern should I use to get most cache hits and the best performance?