Is this Correct?

Manjunath_Gudisi · May 21, 2009, 11:28am

Hi ,

I have a kernel

#define width 2400

#define height 1800

__global__ foo(unsigned char *array)

{

 long idx = blockDim.x * blockIdx.x + threadIdx.x;

long limit = width * height;

//Some operations done here.

 if(idx < limit)

  {

	  // body here

  }

} 

This kernel is calling as:

foo<<<(width*height+511)/512, 512>>>( array );

But I get error cudaErrorLaunchFailour.

My question is :

(1) Can I use <<<(width*height+511)/512, 512>>> as grid dimensions and block dimensions respectively?

 because (width*height+511)/512 very big number.

x248 · May 21, 2009, 11:34am

try to launch with small numbers.
If no error it will be because your numbers are too big as they seems to be.

I think that: Maximum thread per block 512
Maximum blocks 65535

but see the reference manual to be sure.

Manjunath_Gudisi · May 21, 2009, 11:49am

For small numer of grid size it is working fine. :)

The same large grid size is working for onother function, but especially this function is giving the error.

Can you tell the way it should be handled?

x248 · May 21, 2009, 11:55am

I don’ t know, I can just tell that in the manual they say you will have a launch error if
cuda is not able to launch 1 block, and specially with problem of memory.

avidday · May 21, 2009, 12:27pm

Isn’t this just this [url=“http://forums.nvidia.com/index.php?showtopic=97228&view=findpost&p=542438”]http://forums.nvidia.com/index.php?showtop...st&p=542438[/url] again?

The resource limits are clearly described in the CUDA user guide, as is how to calculate them, and in your other thread it was explained how to use compiler options to get the register and shared memory consumption of a given kernel. Why not actually do a spot of reading and thinking about your problem? You might actually learn something…

Jamie_K · May 21, 2009, 2:23pm

Your grid can’t be more than 65535 in each dimension. The largest grid can be 65535*65535 = 4,294,836,225 blocks. You can turn a one-dimensional grid into a 2-dimensional grid using advice from this thread. Or you could simply use (width+511)/512 for the x and height for the y dimension of the grid. I also believe avidday’s advice is very good.

Topic		Replies	Views
block size CUDA Programming and Performance	6	5864	July 21, 2013
Launching Kernel Fail CUDA Programming and Performance	15	3411	May 28, 2014
MAximum block per grid CUDA Programming and Performance	8	5881	April 18, 2011
the maximum number of blocks and threads CUDA Programming and Performance	10	7015	September 4, 2008
help with some cuda programming CUDA Programming and Performance	9	1818	August 31, 2009
Limit on the size of data that can be processed by a kernel Newbie question CUDA Programming and Performance	2	1348	January 16, 2009
Launching 2**41 threads? CUDA Programming and Performance	1	909	May 4, 2009
Сan`t understand what grid dimension to use (cudaDeviceSynchronize error code 4) CUDA Programming and Performance	1	568	February 2, 2018
Size limitation for 1D Arrays in CUDA? CUDA Programming and Performance	9	18301	October 17, 2013
how many threads can used in one grid 5126553565535 CUDA Programming and Performance	1	1664	June 24, 2009

Is this Correct?

Related topics