I have a texture of 20 floats (4 by 5 array). Since it is so small, I am expecting it all to
be resident in cache. (However there are much bigger textures as well.)
I use the values it contains about 16 times per thread. Any ideas when it is
worth buffering by reading into a local variable and using that rather
than calling tex1Dfetch() every time?
The texture is treated as a 2D array–which involes some index calculation.
Any help or comments would be most welcome