Performance Considerations using Texture Access Does the performance depend on the access pattern?

CapJo · August 20, 2009, 8:35pm

The texture cache is quite small (8 K) and shared by 3 multiprocessors. Cached data will pushed out soon by new data.

Are there more information about how the cache is really working … ?

How is it organized? Is the cache using “cache-lines” like on the cpu and what is ment by locality and
in which directions (2D-access)?

Which access pattern should I use to get most cache hits and the best performance?

MisterAnderson42 · August 21, 2009, 12:09pm

The organization depends on how you setup the texture. You can bind a texture directly to global memory for 1D locality, or to a cudaArray for 1D, 2D or 3D locality.

It has been said that the new “pitch-linear” 2D texture bound to global memory still has 1D locality, I haven’t written a microbenchmark to test that for myself, yet.

Just what it says in the programming guide. The best use of the texture cache is to have spatially local accesses among the threads in each warp.

For longer and more explicit descriptions: search the forums for the many other posts on the texture cache by me.

http://www.google.com/search?client=safari…-8&oe=UTF-8

Topic		Replies	Views
Texture cache architecture Line size of texture cache CUDA Programming and Performance	3	2928	August 27, 2008
Understanding Performance of 2D Texture memory accesses CUDA Programming and Performance	3	1938	February 23, 2009
Textures: linear memory vs cudaArrays CUDA Programming and Performance	9	7780	October 16, 2007
Texture cache characteristics 2D cache size CUDA Programming and Performance	5	6083	May 8, 2007
Texture access... Fetch size, Cache size & performance CUDA Programming and Performance	3	2919	December 29, 2009
Texture access performance CUDA Programming and Performance	1	1713	July 30, 2007
Texture cache efficiency using cudaBindTexture2D CUDA Programming and Performance	1	957	October 25, 2009
texture vs global memory CUDA Programming and Performance	0	2881	December 16, 2009
basic texture cache question texture cache: inter- or intra- block? CUDA Programming and Performance	4	3324	January 30, 2008
CUDA texture memory performance CUDA Programming and Performance	4	33529	January 13, 2009

Performance Considerations using Texture Access Does the performance depend on the access pattern?

Related topics