Slow working

artemon · June 28, 2007, 8:09am

Hi Guys,

This function worked quickly on CUDA 0.8, than CUDA 1.0:

__global__ void BlurKernel

                          ( 

        unsigned char* input_data, 

        unsigned char* output_data, 

        int _width, int _height/*,

        int blur_len = 32,

        double blur_force = 0.2*/

        ) 

{

	int blur_len = 4;

	double blur_force = 0.2, radius = 0.0, force = 0.0;

	

    int x = blockIdx.x*blockDim.x + threadIdx.x;

    int y = blockIdx.y*blockDim.y + threadIdx.y;

	int res = 0;

    double count = 0.0;

	unsigned char* src = input_data;

    unsigned char* dst = output_data;

    

	for(int j1 = y - blur_len; j1 <= y + blur_len; ++j1 )

	{

       if( j1 >=0 && j1 < _height )

         {

         src = input_data + j1*_width;

         for( int i1 = x - blur_len; i1 <= x + blur_len; ++i1 )

  	{

  	if( i1 >= 0 && i1 < _width )

              {

              // 0 - данная точка, 1.0 - на расстоянии blur_len

              radius = sqrt (double((j1-y)*(j1-y) + (i1-x)*(i1-x)))/ (double)blur_len;

     //radius = (double(__mul24((j1-y),(j1-y)) + __mul24((j1-y),(i1-x))))/(double)blur_len;

              force = 1 - pow (radius/2, blur_force);

           res += int(*(src + i1)* force);

              count += force;

              } // end if

  	} // end for i1

         } // end if

	} // end for j1

  

	*(dst + y*_width + x) = count ? int(res/count) : *(input_data + y*_width + x);

}

paulius · June 29, 2007, 1:01am

How long does the kernel take in v1.0 and how long did it take in v0.8?

Paulius

artemon · June 29, 2007, 9:44am

0.8 - 47ms
1.0 - 63ms

Topic		Replies	Views
reg values CUDA Programming and Performance	0	1564	July 30, 2007
Kernel slow in 2.0 CUDA Programming and Performance	2	1976	December 4, 2008
blur image CUDA Programming and Performance	2	2876	June 20, 2007
kernel sample CUDA Programming and Performance	0	2313	July 13, 2007
Optimizing memory coalescence doesn't make my program faster CUDA Programming and Performance	12	494	August 6, 2021
kernel function make it slower then normal for loop CUDA Programming and Performance	3	559	August 18, 2016
Help me about coalescing my program run too slow CUDA Programming and Performance	5	2918	May 14, 2008
CUDA trouble CUDA Programming and Performance	3	984	March 19, 2013
help with first cuda program CUDA Programming and Performance	5	3885	June 24, 2009
The kernel isn't working CUDA Programming and Performance	9	1052	January 19, 2011

Slow working

Related topics