texture memory copying wrong data

rachit · February 25, 2010, 7:36am

hi, i am writing a program for convolution which is working fine otherwise but creates a problem when i use texture memory.

i managed solve the syntactical & compilation issues but perhaps i am missing some information which is causing this problem.

i have a matrix named N(5x5) which i want to copy to the texture memory. the elements of the matrix are as follows:

[N] = 2 3 4 5 6

     3   4   5   6   7

    4    5   6   7   8

    5    6   7   8   9

    6    7   8   9   10

when i access the values in the device code (before the computation) they are somewhat like 2, 2.5, 3.5, 4.5, 5.5, 6.5, 7.5, 8.5, 9.5

i cant understand why? please help, thanks…

#include<stdio.h>

#include<cuda.h>

texture<float, 1, cudaReadModeElementType> tex;

__global__ void MatrixMulKernel(float* M, float* P, int Width)

{

	float Pvalue = 0;

	int modval = (Width - 1) / 2;

	for (int k = 0; k < Width; ++k)

	{

		for (int j = 0; j < Width; ++j)

		{

			float Melement = M[k*Width+j];

			float Nelement = 0;

				if (threadIdx.y-modval+k >= 0 && threadIdx.y-modval+k < 5 && threadIdx.x-modval+j >= 0 && threadIdx.x-modval+j < 5)

			{

				Nelement = tex1D(tex, float((threadIdx.y-modval+k)*5+(threadIdx.x-modval+j)));

				printf("k = %d\tj = %d\ttx = %d\tty = %d\tnelem = %f\n", k, j, threadIdx.x, threadIdx.y, Nelement);	

		}

	   			Pvalue += Melement * Nelement;

		}

	}

	P[threadIdx.y*5+threadIdx.x] = Pvalue;

}

int main()

{

	float M[3][3], N[5][5], P[5][5];

	float *Md, *Nd, *Pd;

	int height = 5;

	int width = 5;

	int wd = 3;

	int size = width * height * sizeof(float);

	cudaMalloc((void **)&Md, 9*sizeof(float));

	cudaMalloc((void **)&Nd, 25*sizeof(float));

	cudaMalloc((void **)&Pd, size);

	int i, j;

	for (i = 0; i < 3; i++)

	for (j = 0; j < 3; j++)

	{

		M[i][j] = i+j+1;

	}

	for (i = 0; i < 5; i++)

	for (j = 0; j < 5; j++)

	{

		N[i][j] = i+j+2;

		P[i][j] = 0;

	}

	cudaMemcpy(Md, &M, 9*sizeof(float), cudaMemcpyHostToDevice);

		cudaArray* cuArray;

		cudaMallocArray (&cuArray, &tex.channelDesc, width*height, 1); 

		cudaMemcpyToArray(cuArray, 0, 0, &N, sizeof(float)*width*height, cudaMemcpyHostToDevice); // bind a texture to the CUDA array

		cudaBindTextureToArray(tex, cuArray);

		tex.normalized = false; tex.filterMode = cudaFilterModeLinear;

	dim3 dimGrid(1, 1);

	dim3 dimBlock(5, 5);

	MatrixMulKernel <<< dimGrid, dimBlock >>> (Md, Pd, wd);

	cudaMemcpy(&P, Pd, sizeof(float)*width*height, cudaMemcpyDeviceToHost);

	for (i = 0; i < 5; i++)

	{

		for (j = 0; j < 5; j++)

		{

			printf("P[%d][%d] = %f\t", i, j, P[i][j]);

		}

		printf("\n");

	}

		cudaUnbindTexture(tex);

		cudaFreeArray(cuArray);

	cudaFree(Md);

	cudaFree(Nd);

	cudaFree(Pd);

	return 0;

}

fcs · February 25, 2010, 3:58pm

you set “tex.filterMode = cudaFilterModeLinear;” so you are using native linear interpolation in your texture cache.
If it’s really what you want to do, you may have to substract 0,5 to each of your floatting coordinate to have correct results. (for ex: tex2d(tex,x-0.5,y-0.5)
Otherwise just set tex.filterMode to cudaFilterModePoint

Hope that helps

rachit · February 25, 2010, 4:24pm

hey thanks fcs

setting the filter mode to cudaFilterModePoint solved the problem…

Topic		Replies	Views
Problem with texture memory CUDA Programming and Performance	10	1765	August 6, 2010
Using cuda texture memory for 1D interpolation CUDA Programming and Performance	1	2897	December 25, 2012
How to define texture properly CUDA Programming and Performance	10	6300	November 5, 2007
cudaBindTexture2D problem CUDA Programming and Performance	3	11773	August 3, 2010
some wrong values in texture (using tex2D) CUDA Programming and Performance	1	1584	April 12, 2009
Weird behaviour with large data and textures CUDA Programming and Performance	0	674	August 3, 2009
Using texture CUDA Programming and Performance	0	2156	April 21, 2009
CUDA Texture Memory Example for Beginners CUDA Programming and Performance	6	5109	July 10, 2023
Texture contains wrong values Using tex1Dfetch CUDA Programming and Performance	17	6631	February 17, 2009
comfused with tex2D()... CUDA Programming and Performance	0	1336	August 22, 2010

texture memory copying wrong data

Related topics