Problem with 1D Texture and cudaAddressModeBorder

LorenS · July 15, 2013, 2:20pm

I am trying to use a 1D texture as a lookup table. I am setting cudaAddressModeBorder so that values out of range return 0, but that is not what I am getting. What am I doing wrong?

Here is a simple version of the code:

#include

float2 *lookUpTableData;
cudaArray *lookUpTable;

texture<float2, cudaTextureType1D, cudaReadModeElementType> texRef;

global void InitializeLookupTables(float2 *lookUpTableData)
{
int position = blockIdx.x * blockDim.x + threadIdx.x;
lookUpTableData[position].x = 2.0f * threadIdx.x;
lookUpTableData[position].y = 4.0f * threadIdx.x;
return;
}

global void TestLookupTables()
{
float2 lookUpTableValues;
for (int i=0; i<15; i++) {
lookUpTableValues = tex1D(texRef, 5.0f * i + 0.5f);
printf(“The value returned for value %d is %f.\n”, 5*i, lookUpTableValues.x);
}
}

int main() {
size_t lookUpTable_width = 51;
size_t lookUpTable_size = size_t (sizeof(float2) * lookUpTable_width);
cudaMalloc((void**) &lookUpTableData, lookUpTable_size);

cudaChannelFormatDesc channelDesc = cudaCreateChannelDesc<float2>();
cudaExtent length = make_cudaExtent(lookUpTable_width, 0, 0);
cudaMalloc3DArray(&lookUpTable, &channelDesc, length, cudaArrayDefault);
InitializeLookupTables<<<1, 51>>>(lookUpTableData);
cudaDeviceSynchronize();

cudaMemcpyToArray(lookUpTable, 0, 0, lookUpTableData, lookUpTable_size,  cudaMemcpyDeviceToDevice);
texRef.addressMode[0] = cudaAddressModeBorder;
texRef.filterMode = cudaFilterModeLinear;
texRef.normalized = false;

cudaBindTextureToArray(texRef, lookUpTable, channelDesc);

TestLookupTables<<<1, 1>>>();
cudaDeviceSynchronize();
return 0;

}

I also tried replacing the two lines

cudaExtent length = make_cudaExtent(lookUpTable_width, 0, 0);
cudaMalloc3DArray(&lookUpTable, &channelDesc, length, cudaArrayDefault);

with

cudaMallocArray(&lookUpTable, &channelDesc, lookUpTable_width, 0, cudaArrayDefault);

but get the same output.

Here is the output I get when running the program. I was expecting all values greater than 50 to print 0.0, not 100.

The value returned for value 0 is 0.000000.
The value returned for value 5 is 10.000000.
The value returned for value 10 is 20.000000.
The value returned for value 15 is 30.000000.
The value returned for value 20 is 40.000000.
The value returned for value 25 is 50.000000.
The value returned for value 30 is 60.000000.
The value returned for value 35 is 70.000000.
The value returned for value 40 is 80.000000.
The value returned for value 45 is 90.000000.
The value returned for value 50 is 100.000000.
The value returned for value 55 is 100.000000.
The value returned for value 60 is 100.000000.
The value returned for value 65 is 100.000000.
The value returned for value 70 is 100.000000.

pQB · July 16, 2013, 11:52am

Hello,

your experiment works as expected. With cudaAddressModeBorder the border value is returned.

I think you can not specify a value unless you do some trick, for example, adding a border of size one with the given value.

Take a look at [url]CUDA Runtime API :: CUDA Toolkit Documentation for the CUDA texture address modes

Best,
Pablo

LorenS · July 16, 2013, 5:55pm

Hi,

I must not be understanding the documentation then.

My understanding is that cudaAddressModeClamp returns the border value. And cudaAddressModeBorder should give me 0.0.

Here is what what the C Programming Guide version 5.0 says:

“The default addressing mode is to clamp the coordinates to the valid range: [0, N) for non-normalized coordinates and [0.0, 1.0) for normalized coordinates. If the border mode is specified instead, texture fetches with out-of-range texture coordinates return zero.”

Thanks,

Loren

njuffa · July 16, 2013, 6:48pm

As I had mentioned recently in a similar thread, there is a known CUDA driver bug where the Border mode is treated as ClampToEdge when non-normalized coordinates are used. I do not know which driver version was the first affected and likewise do not know which driver release will contain the fix.

You are correct that Border mode should return zero for an out-of-bounds texture access (the border value is fixed at 0 in CUDA, it is not settable).

pQB · July 17, 2013, 7:37am

You are right LorenS. I reviewed the documentation quickly and it was me who was confused. My apologies.

LorenS · July 18, 2013, 7:54pm

LorenS · July 18, 2013, 7:56pm

Thanks for the information. I did try to see if a similar topic has appeared, but I couldn’t get the search functions in the forum to return a reasonable list of topics to review.

Thanks,

Loren

Topic		Replies	Views
cudaAddressModeBorder does not return zero? CUDA Programming and Performance	2	1695	July 2, 2013
How to change the adress mode of a texture? CUDA Programming and Performance	14	3316	January 20, 2014
cudaAddressModeBorder CUDA Programming and Performance	6	23929	January 10, 2012
The boundary of texture CUDA Programming and Performance	5	6915	May 20, 2008
Texture clamp address mode does not work CUDA Programming and Performance	4	6267	August 12, 2010
Out of boundary Texture Reference get what? CUDA Programming and Performance	1	1790	February 7, 2008
Texfetch Coordinates out of bound CUDA Programming and Performance	1	1975	June 1, 2007
Texture question A 2 dimensional texture question CUDA Programming and Performance	4	3754	October 26, 2009
clamping texture to zero / adding a zero border CUDA Programming and Performance	5	2490	September 15, 2011
address mode in cuda 1d texture binding to linear memory CUDA Programming and Performance	0	723	November 2, 2015

Problem with 1D Texture and cudaAddressModeBorder

Related topics