How to handle a set(array) of 2D textures

CudaaduC · September 15, 2015, 12:44am

For the first time would like to try to use the linear interpolation feature of 2D texture memory.

This link was useful for the case where there is one 2D texture;

[url]http://on-demand.gputechconf.com/gtc-express/2011/presentations/texture_webinar_aug_2011.pdf[/url]

In my use case I will have an input set of 60 matrices (float) of example size (500,700).
I would like to handle 10 such matrices per kernel launch, and would attempt to have 10 distinct textures, in a array form(if possible) like this;

texture<float, cudaTextureType2D, cudaReadModeElementType> tex[10];

Then bind the current batch of 10 (500,700) matrices to that set of textures before each kernel call, use, unbind and repeat process until done.

Since I only need to do the interpolation across (x,y) I believe I should not use the 3D textures because their interpolation is across(x,y,z).

I already implemented a working application using __ldg() and my own interpolation, but since this is a built in feature of CUDA textures thought that there might be a faster approach.

Did Google this topic before posting, and could not find a specific answer or( even more useful ) a working example. Using textures in such a manner has a more complicated set-up process than handling standard device memory.

How would I go about doing this and would it result in better performance than using __ldg()?

njuffa · September 15, 2015, 1:06am

You will find out that (unless things have changed drastically since I last looked) you cannot make an array of textures like your code above shows.

What I have done before is use two textures and select the appropriate one at access time based on coordinates. That approach does not scale to ten textures in high-performance fashion, obviously. If the individual textures have identical sizes, you may be able to combine the data for multiple of them into one larger texture, similar to the way the graphics guys do this.

I am not sure why you would dismiss 3D textures right away. I have used 2D textures before where I just needed to interpolate in the x-dimension. Similarly you can use a 3D texture and just interpolate in one or two dimensions.

Robert_Crovella · September 15, 2015, 4:03am

If you’re working on cc3.0 or higher hardware, you can make an array of texture objects. As njuffa points out, an array of texture references is difficult or impossible.

The answer to this question may be of interest:

[url]c++ - Cuda Create 3d texture and cudaArray(3d) from device memory - Stack Overflow

It happens to demonstrate an array of 3D textures (i.e. objects), but it should be straightforward to convert it to an array of 2D textures (objects).

If you simply want an example that shows 2D interpolation, I think there are one or more of those in the CUDA samples. The bindless texture and simple texture 3D examples may be of interest.

cbuchner1 · September 15, 2015, 9:17am

An alternative to hundreds of texture object handles (each of which requires 8 bytes), would be to use layered cudaArrays, bound to a single texture object. The one drawback is that you have to know in advance what the maximum number of layers will be. Any dynamic resizing requires the creation of a new cudaArray, and re-upload of all the layers.

Christian

HannesF99 · September 15, 2015, 12:19pm

Use texture objects and wrap them in a c++ class.

See
http://devblogs.nvidia.com/parallelforall/cuda-pro-tip-kepler-texture-objects-improve-performance-and-flexibility/

Topic		Replies	Views
An array of texture references? CUDA Programming and Performance	30	29721	October 29, 2007
Overlap of Data Transfer and Kernel Execution CUDA Programming and Performance	3	1380	March 4, 2011
Why there is no cudaBindTexture3D? It would be nice to have this ... CUDA Programming and Performance	9	4664	December 12, 2009
Array of texture references CUDA Programming and Performance	3	5392	April 20, 2011
Repeated 1D interpolation with type promotion CUDA Programming and Performance	3	568	October 12, 2021
texture array or texture pointer want to dynamic allocate texture CUDA Programming and Performance	5	4210	April 28, 2008
What's the instruction throughput for texture fetches? How to refresh texture object without recopying? CUDA Programming and Performance	5	867	April 26, 2018
Understanding CUDA texture 2D linear interpolation CUDA Programming and Performance	4	2613	May 10, 2022
3D Geographic Interpolation too inaccurate How to best deal with poor texture interpolation? CUDA Programming and Performance	9	1377	December 19, 2024
Array of texture references CUDA Programming and Performance	8	8353	April 16, 2009

How to handle a set(array) of 2D textures

Related topics