Texture and Global Memory

pototschnig · July 9, 2007, 8:50am

Hi,

as I know the texture memory is cached and the global memory not.

What does make more sense?

load data into texture memory, perform operation, output is in global memory because texture memory is read only. Then copy from global memory into texture memory for the next step and so on …

or:

load data into texture memory once and then only operate on global memory.

or:

don’t use texture memory if I don’t need features like interpolating or reading uchars as normalized floats?

I can’t estimate how much different this will make in speed.

But if I understand this right: If I copy data from global memory to texture memory I have to access every pixel twice because I need to get the data out of the texture memory again.

So it would be better to avoid device to device copy?

regards
Pototschnig

asadafag · July 10, 2007, 3:14am

You can directly bind global memory to 1D textures, but “which is better” depends on your access pattern.
My test program shows time cost of:
coalesced global memory read < texture fetch < not-coalesced global memory read
so if you read memory randomly, or may access the same address in multiple threads, or have to read unaligned memory, you may want to use a texture. Otherwise, global memory could be better.

Morph208 · July 11, 2007, 7:15am

Yes it’s gonna depend of your algorithm. It’s hard to tell. In addition of what asadafag said, I’ve improved by a factor 2 my algorithm by stopping device-device memory. In the beginning I was binding textures to cudaArrays (because texture fetching are optimized for cudaArray). Since it’s not possible to address directly cudaArray within a kernel, I was using a buffer in global memory (allocated with cudaMalloc) and working with it from my kernel. And at the end I was copying the buffer to my cudaArray with a MemcpyToArray. Now I’m directly binding a texture to my buffer. So no need from device-device copy anymore. In my case, it’s much faster.

Topic		Replies	Views
When to use textures CUDA Programming and Performance	7	8115	February 12, 2008
Question about textures CUDA Programming and Performance	5	7833	May 9, 2008
When is it worth copying global to texture memory CUDA Programming and Performance	2	3359	July 7, 2008
Texture vs Global memory which of this is faster? CUDA Programming and Performance	2	5460	August 18, 2011
Texture vs. Global Memory CUDA Programming and Performance	4	2010	August 6, 2009
Is coalescing access important to texture memory? CUDA Programming and Performance	10	12802	March 16, 2008
Copy from texture memory to shared memory Confused about best transfer strategy CUDA Programming and Performance	4	1545	February 11, 2010
Use texture memory or global memory in this case? CUDA Programming and Performance	3	1980	August 13, 2016
Texture memory when to use ? CUDA Programming and Performance	6	20141	October 7, 2009
textures how do they work CUDA Programming and Performance	1	3360	September 18, 2008

Texture and Global Memory

Related topics