Is coalescing access important to texture memory?

Hella_Yu · March 12, 2008, 6:29pm

From the tutorials I read, seems coalescing is always associated with global memory.

Texture memory is cached, while global memory is not. So is coalescing access meaningful to texture memory read/write as to global memory?

Thanks.

DenisR · March 12, 2008, 7:02pm

No, but it is useful to have some local coherence in the texture. So within a warp access elements that are close together, since the texture cache has some features to optimize that (a load from the texture cache probably fetches also the nearest neighbors so when you want to read those too, they are already in cache)

Hella_Yu · March 13, 2008, 12:51am

The texture memory is “physically” the same as global memory , right? So with cacheing, the first time fetching data from texture also benefits from coalescing, is that correct?

DenisR · March 13, 2008, 6:45am

I am not sure anybody outside of NVIDIA knows for sure, but I would guess so. Als when using 2D-textures there is 2D-locality, so there might be some other trick/technique employed.

Anyhow, when treating textures as a black box, I would like to quote Mr Anderson in saying: textures are very useful when you have almost coalesced accesses. But also for random access it should be useful.

Sarnath · March 13, 2008, 6:59am

btw, if accesses are always coalesced, you wont get any benefits from texture memory. It is better to leave it global. This was discussed sometime back in this forum.

Eri_Rubin · March 13, 2008, 9:14am

its even more definite if you can have all your accesses coalesced then you will get better performance using non texture memory.

Hella_Yu · March 13, 2008, 2:33pm

Interesting point… But why?

‘coalesed read from texture memory to shared memory’ should not be worse then 'coalesed read from global memory to shared memory, ’ if not better, right?

MisterAnderson42 · March 13, 2008, 3:38pm

Reading from textures requires the usage of a few extra registers for addressing and calling the texture unit. The extra register usage can change your occupancy, and the total throughput through the texture unit is slightly less than than from a coalesced read.

Due to the considerations, if you can coalesce it is advisable to do so instead of using the texture read, especially when performing multiple reads within a thread. The only exception is when reading 128-bit types where textures are faster than coalesced reads.

paulius · March 14, 2008, 6:00pm

That isn’t always true. There are apps that achieve a slightly higher memory throughput reading from textures (fully coalesced addressing, if it were done for gmem) and writing result to gmem.

Paulius

DenisR · March 14, 2008, 7:23pm

Paulius, is there a general guideline when this is the case? (for instance memory-bound kernels) Otherwise I need to rewrite quite a lot of code to see if this is true for my kernels.

Eri_Rubin · March 16, 2008, 8:57am

Thats interesting … do u know why ?

Topic		Replies	Views
When is it worth copying global to texture memory CUDA Programming and Performance	2	3365	July 7, 2008
When to use textures CUDA Programming and Performance	7	8132	February 12, 2008
For what case should I use texture memory? CUDA Programming and Performance	8	2674	May 26, 2010
Texture and Global Memory CUDA Programming and Performance	2	3846	July 11, 2007
Question about textures CUDA Programming and Performance	5	7839	May 9, 2008
Copy from texture memory to shared memory Confused about best transfer strategy CUDA Programming and Performance	4	1567	February 11, 2010
Benefits of Texture Memory couldnt use them... CUDA Programming and Performance	6	3210	February 13, 2008
Question about texture/shared memory enhance the computing efficiency CUDA Programming and Performance	3	5387	December 4, 2007
Reading data CUDA Programming and Performance	12	2702	July 18, 2011
Texture? Just a short lesson... CUDA Programming and Performance	5	2720	March 9, 2008

Is coalescing access important to texture memory?

Related topics