why texture makes it slower?

KaiK · July 8, 2009, 9:20am

Hi!

I’m working in a code where I have to read from diferent pixels of a image.

The pixels that contiguous threads are not contiguous, but they’re usually near.

I don’t know why, my code runs faster when I put the image into global memory than when I put it into a texture.

Can anyone help me?

Here’s my kernel code (case global mem)

global void my_kernel(int3 voxelsDim, float voxel_size, int blocks_x, int blocks_y, unsigned char *p_im_data , float *vox_n_cams_dev)
{

//determine the real index of the thread in x and y
__shared__ int block_x_pos;
__shared__ int  block_y_pos;
if(threadIdx.x == 0 && threadIdx.y == 0){
	block_x_pos = blockIdx.x % blocks_x;
	block_y_pos = blockIdx.x / blocks_x;
}
__syncthreads();


int2 thread = make_int2(blockDim.x * block_x_pos + threadIdx.x, blockDim.y * block_y_pos + threadIdx.y);


//tests if thread is inside the working zone
if(thread.x < voxelsDim.x && thread.y < voxelsDim.y && blockIdx.y < voxelsDim.z){
	
	//determines the index for the 1D array.
	int index = thread.x  +  thread.y * voxelsDim.x  +  blockIdx.y * voxelsDim.x * voxelsDim.y;
	
	
        //some functions with registers that returns "unadjacent _index"
	
	
	//unadacent read, and adjacent write
	if(p_im_data[ unadjacent_index ] > 128 ){    //when texture mem, here I use:   if(tex2D(tex_image, unadj_index_x, unadj_index_y) > 128 )
                vox_n_cams_dev[index]++;
            }
		    
}

}

any explanation about what’s going wrong?

with global memory, it takes: 5.979680 ms
with texture memory, it takes: 6.449344 ms

thanks in advanced!

Enrique oriol

Topic		Replies	Views
Why are texture memory reads slower than global reads even though it is being accessed spatially? CUDA Programming and Performance cuda	0	457	June 19, 2020
Where to store a picture about 5 MB CUDA Programming and Performance	2	3151	June 17, 2009
Texture vs. Global Memory CUDA Programming and Performance	4	2018	August 6, 2009
Shared memory problem CUDA Programming and Performance	3	2259	February 8, 2008
Texture Memory in Maxwell is slower than global memory? CUDA Programming and Performance cuda	1	309	December 30, 2023
Block dim discussion 1D vs 2D CUDA Programming and Performance	8	8359	August 14, 2007
In what case, using text mem is slower than not using? CUDA Programming and Performance	3	1428	September 20, 2009
When to use textures CUDA Programming and Performance	7	8132	February 12, 2008
Texture memory fetch extremely slow CUDA Programming and Performance	13	3137	December 21, 2017
Memory performance in image processing example CUDA Programming and Performance	9	1621	March 24, 2011

why texture makes it slower?

Related topics