Putting a linearized 3D array in texture memory

Feynstein · October 3, 2016, 4:33pm

Hi,

I’m using CUDA to accelerate a medical physics algorithm and I have been told that using textures for read-only arrays would be a good way to optimize it.

In a first attempt using only global memory, I decided to linearize the 3D array in regard to the x coordinate, thinking that by using larger block sizes in x would make the threads fetch stuff in the memory that are close by. It did not work as expected, probably because there is A LOT of divergence in the kernel anyway…

Since the array I am trying to use does not need to change during computation, I was told that texture memory would be a good bet. I tried putting it in 1D memory, but it’s probably way too big for that… 128x128x128…

I would like to know what would be the best choice for me, since my array is already linearized. If it would be better for me to put it in a 2D array and use the simple texture example as a reference or use a more fancy approach using pitched pointer,channel and extent stuff.

Obviously I’m more of a physicist than a programmer so something simple would be great.

Thanks

edit: To be more precise, it’s a radiation raytracing algorithm that needs to read a (3D) density map along the way…

njuffa · October 3, 2016, 4:49pm

From the vague description it sounds like reading through the texture path would indeed be appropriate. But on modern GPUs, you don’t need to set up explicit textures to take advantage of that. Instead, look into using the __ldg() intrinsic (see documentation).

__ldg() maps to an LDG machine instruction that reads through the texture path. While the compiler can also use LDG automatically, if you want to be absolutely sure that LDG will be used, use the intrinsic. Note that if the compiler happens to already use LDG everywhere, you won’t see a performance increase over your current code. You can use cuobjdump --dump-sass to look at the generated machine code before/after.

Topic		Replies	Views
Linear Memory Vs CUDA Array for texture binding CUDA Programming and Performance	2	17661	January 21, 2010
Question about Global memory and Texture memory CUDA Programming and Performance	5	956	October 23, 2014
c++array, cudaarray, texture CUDA Programming and Performance	1	4514	October 18, 2008
Question about textures CUDA Programming and Performance	5	7833	May 9, 2008
Binding 3D Texture to linear memory CUDA Programming and Performance	2	1816	June 17, 2018
Texture memory when to use ? CUDA Programming and Performance	6	20141	October 7, 2009
basic doubts about cuda CUDA Programming and Performance	9	3764	February 7, 2008
How to handle a set(array) of 2D textures CUDA Programming and Performance	4	4717	September 15, 2015
Textures: linear memory vs cudaArrays CUDA Programming and Performance	9	7773	October 16, 2007
1D texture bound to linear memory CUDA Programming and Performance	2	2009	December 18, 2009

Putting a linearized 3D array in texture memory

Related topics