use thread to copy global to shared memory

mavmak · December 8, 2013, 3:43pm

I have a 512x512 image and I want to apply some effect on it, to apply a specific effect on a region I need part of the image larger than the region to make the calculations. So I want to use the threads index to copy data from global memory. Make the calculation using the shared memory and then copy back only the interesting part.

This is my kernel

__global__ void filter(unsigned char *image, unsigned char *out, int n, int m)
{
	int i = threadIdx.x + blockIdx.x * blockDim.x;
        int j = threadIdx.y + blockIdx.y * blockDim.y;
	int bindex = threadIdx.x + threadIdx.y * blockDim.x;
        int index = i + j * blockDim.x * gridDim.x;
	
	__shared__ unsigned char shared[16][16*3];


	if (bindex < 256 && index < n*m)

	{
		shared[threadIdx.x][threadIdx.y*3+0] = image[index*3+0];
		shared[threadIdx.x][threadIdx.y*3+1] = image[index*3+1];
		shared[threadIdx.x][threadIdx.y*3+2] = image[index*3+2];
	}



	__syncthreads();
	

}

and I am use it like:

cudaMalloc( (void**)&dev_image, n*m*3);
	cudaMalloc( (void**)&dev_out, n*m*3);
	cudaMemcpy( dev_image, image, n*m*3, cudaMemcpyHostToDevice);
	
	dim3 threads( 16, 16 );
	dim3 blocks( 32, 32 );
        filter<<<blocks, threads>>>(dev_image, dev_out, n, m);
        cudaMemcpy( out, dev_out, n*m*3, cudaMemcpyDeviceToHost );

Does this copy the wondered region or not?

Topic		Replies	Views
Shared Memory question CUDA Programming and Performance	5	2968	November 25, 2016
memcpy equivalent for global memory to shared memo CUDA Programming and Performance	5	9388	November 12, 2007
Copying data into shared memory CUDA Programming and Performance	9	3877	July 1, 2009
Using shared Memory CUDA Programming and Performance	3	4939	March 11, 2012
From Global to Shared Copy some data from Global mem to Shared mem CUDA Programming and Performance	2	3414	November 25, 2011
Image convolution with Shared Memory CUDA Programming and Performance	0	1162	March 2, 2010
copying to shared block mem CUDA Programming and Performance	11	4344	April 6, 2008
Copying data to shared memory CUDA Programming and Performance	3	816	March 2, 2018
Shared memory vs global memory CUDA Programming and Performance	6	3542	April 30, 2007
Access to global memory doesnt work without shared-buffering? CUDA Programming and Performance	2	2961	February 13, 2009

use thread to copy global to shared memory

Related topics