Textures: linear memory vs cudaArrays

e.ping · October 8, 2007, 4:01pm

Can someone tell me if linear memory which is bound to a texture and read via a texture reference is cached in the texture caches in the same way as reads from cudaArrays? I have read the programming guide but its not clear. It mentions that cudaArrays are optimised for texture fetching, though linear mem can also be used via texture fetching, so i’m not sure where the performance difference comes from?

Thanks,
Owen

paulius · October 8, 2007, 5:31pm

Both get cached. Cache-behavior of textures bound to 2D cudaArrays may be better, due to cache optimization for 2D locality.

Paulius

e.ping · October 9, 2007, 10:23am

Thanks for the helpful reply!

paulius · October 9, 2007, 5:52pm

Depending on your access pattern, I’d suggest some experimenting, to see which gives you better performance. I’ve seen some cases where 1D texture bound to linear memory gave higher performance for sequential accesses.

Paulius

MisterAnderson42 · October 9, 2007, 6:45pm

My tests confirm this. I tried to “cheat” the 2D cache by putting my 1D data into a M x M/N 2D array (where M is small). Performance was slower by about 5% than using the simple 1D data bound to device memory. Plus, with it bound to device mem, you don’t need to do Dev->Dev transfers to update the cudaArray.

alex_dubinsky · October 9, 2007, 10:05pm

The Programming Guide says:

“The cache working set for one-dimensional textures is 8 KB per multiprocessor;”

What about two-dimensional cudaarrays?

e.ping · October 11, 2007, 4:24pm

Ah i’m laughing, I went in search of that exact question and ended back up at my own thread. I would really appreciate any answer as well about this. Does the above line simply mean that the effective cache size per multiprocessor for textures is 8KB period? Ie. regardless of texture dimension even though it mentions 1D specifically.

Thanks,

Owen

asadafag · October 12, 2007, 3:56am

The dark secret of 2D cache seems nVidia classified:(
Maybe we should test it ourselves.

Simon_Green · October 16, 2007, 2:06pm

I’m pretty sure the effective cache size is the same for 1D, 2D or 3D textures. I’m not sure why we say that in the programming guide.

The texture cache is basically there to provide good performance when accessing neighbouring texels for filtering, nothing more.

e.ping · October 16, 2007, 3:31pm

Thanks for the explanation Simon.

Topic		Replies	Views
Memory performance in image processing example CUDA Programming and Performance	9	1617	March 24, 2011
Question about texture memory CUDA Programming and Performance	3	4450	May 27, 2009
Performance Considerations using Texture Access Does the performance depend on the access pattern? CUDA Programming and Performance	1	1397	August 21, 2009
Texture cache efficiency using cudaBindTexture2D CUDA Programming and Performance	1	958	October 25, 2009
Texture cache characteristics 2D cache size CUDA Programming and Performance	5	6109	May 8, 2007
binding texture with linear memory CUDA Programming and Performance	1	4301	April 21, 2007
Texture cache architecture Line size of texture cache CUDA Programming and Performance	3	2935	August 27, 2008
cudaBindTexture2D vs cudaBindTextureToArray CUDA Programming and Performance	4	11385	October 24, 2009
Texture access performance CUDA Programming and Performance	1	1716	July 30, 2007
Linear Memory Vs CUDA Array for texture binding CUDA Programming and Performance	2	17676	January 21, 2010

Textures: linear memory vs cudaArrays

Related topics