texture memory binding performance

yaobin · May 28, 2012, 5:57am

I am implementing a image processing project. It needs a real time processing about 30 frames per second. One frame image is about 2007004bytes.

I bind one frame of image in the global memory to texture memory, it takes about 7ms for binding. 7ms is a long time for real time processing. I don’t know if there is anything wrong? or it’s real that binding a image to texture memory needs to take a such long time.

Any help or suggestion will be helpful, thanks

tera · May 28, 2012, 7:39am

You don’t need to bind and unbind the texture for each frame. You can just copy new data to it and invoke a new kernel.

yaobin · May 29, 2012, 4:36pm

Thanks for reply. can you show me an example?

tera · May 31, 2012, 5:47pm

Sorry, I don’t have an example handy. What I meant was that

[*]copy data to GPU array

[*]bind texture to GPU array

[*]call kernel

[*]unbind texture

[*]copy data to GPU array

[*]bind texture to GPU array

[*]call kernel

[*]unbind texture

[*]potentially more iterations…

can safely be replaced by

[*]copy data to GPU array

[*]bind texture to GPU array

[*]call kernel

[*]copy data to GPU array

[*]call kernel

[*]potentially more iterations…

[*]unbind texture

rooobosmith · June 13, 2012, 6:14pm

Do any changes to the GPU array appear immediately in the bound texture?

For example:

[*]copy data to GPU array

[*]bind texture to GPU array

[*]call kernel with arg ptr to GPU array which modifies it

Does the bound texture immediately reflect the modifications within the kernel?

njuffa · June 13, 2012, 6:35pm

Due to the non-coherent nature of the texture cache, changes to underlying storage in a kernel may or may not be visible when accessing through the texture path in the same kernel. In essence, the behavior is undefined.

Before every kernel launch, the texture cache is flushed, so reading through a texture in a kernel will correctly reflect changes to underlying storage made by a previous kernel, or by a CUDA API call preceeding the kernel. This is the scenario tera was showing in his pseudo code.

[Later:] See section 3.2.10.4 of the CUDA C Programming Guide

Topic		Replies	Views
Unbind and rebind texture CUDA Programming and Performance	3	6120	January 15, 2009
dynamic update of texture in a kernel is it worth it CUDA Programming and Performance	4	1555	May 11, 2009
Texture bind reuse or rebind CUDA Programming and Performance	4	490	June 22, 2022
CUDA texture memory performance CUDA Programming and Performance	4	33540	January 13, 2009
RT video processing: Use texture fetches or not? question about using tecture cache CUDA Programming and Performance	4	3212	August 18, 2008
Texture binding CUDA Programming and Performance	1	756	April 15, 2009
Texture memory fetch extremely slow CUDA Programming and Performance	13	3099	December 21, 2017
cudaBindTexture synchronization question CUDA Programming and Performance	2	4360	November 26, 2013
texture memory performance on a multiGPU system takes too much time to setup a texture for some GPUs CUDA Programming and Performance	9	1391	February 16, 2012
Texture Memory ! CUDA Programming and Performance	3	7163	January 11, 2010

texture memory binding performance

Related topics