blocks bigger than 512 threads I can't see the error

tatou1234 · March 2, 2009, 12:28pm

Hi,

I have a simple program where you have an array with the next values: a[0]=0; a[1]=1; a[2]=2;…a[n]=n;

The aim of the kernel is plus by one each position: a[0]=1; a[1]=2;…a[n]=n+1;

I’ve used one block and the variable “size” contains the numbers of threads which is the size of the array too.

So if the size is less than 512 all is correct. But for example the size is 560 all is wrong because the result is the same of the first array (no change).

I know that maximum sizes of dimension of a thread block is 512.

I want to know how I can see the error! I put this but it’s not a solution: CUT_CHECK_ERROR(“Kernel execution failed”);

[codebox]

global void PLUS (float* C)

{

int i = threadIdx.x;

C[i]=C[i]+1.0;

}

int main(int argc, char** argv)

{

if (argc!=2){

    printf("wrong number of arguments!!!!!!\n");

}else{

unsigned int size =512;  //if size is bigger than 512 the array isn't change.

unsigned int mem_size = sizeof(float) * size;

float* h_C = (float*) malloc(mem_size);

for (int i = 0; i < size; i++)

{

    h_C[i] = (float)i;

}

float* d_C;

CUDA_SAFE_CALL(cudaMalloc((void**) &d_C, mem_size));

CUDA_SAFE_CALL(cudaMemcpy(d_C, h_C, mem_size, cudaMemcpyHostToDevice) );

PLUS <<< 1 , size >>> (d_C);

CUT_CHECK_ERROR(“Kernel execution failed”);

CUDA_SAFE_CALL(cudaMemcpy(h_C,d_C, mem_size, cudaMemcpyDeviceToHost) );

CUT_CHECK_ERROR(“Memcpy execution failed”);

printf(“h_C[%d]=%f\n”,atoi(argv[1]),h_C[atoi(argv[1])]);

CUDA_SAFE_CALL(cudaFree(d_C));

free(h_C);

}

}[/codebox]

What is the command for watch the error???

Thank you,

MisterAnderson42 · March 2, 2009, 12:44pm

CUT_CHECK_ERROR is a no-op in release builds. Just look up its definition int the header file and you will see.

Call cudaThreadSychronize() followed by cudaGetLastError() to get any error code from a kernel launch. There is a function for converting the error code to a human readable string, too. Just look it up in the reference manual.

tatou1234 · March 2, 2009, 1:33pm

thank you MisterAnderson42 but there isn’t any error code. Simply looks like the program doesn’t read the kernel.

Why doesn’t exist error??

Topic		Replies	Views
above 512 threads unchecked ? CUDA Programming and Performance	1	5802	January 8, 2009
No error for exceeding thread/grid size? CUDA Programming and Performance	0	5232	August 9, 2007
Exceeding number of threads/block in a kernel CUDA Programming and Performance	1	2734	July 24, 2010
Threads and blocks concept question Invoking a kernel CUDA Programming and Performance	2	1698	December 5, 2007
Kernel function doesn't launch with block size >16 Block size of 4, 8, and 16 launch fine CUDA Programming and Performance	2	2918	July 28, 2008
Development with Cuda CUDA Programming and Performance	8	1755	July 20, 2009
Setting block size and avoiding errors CUDA Programming and Performance	7	6312	November 15, 2008
cudaErrorUnknown CUDA Programming and Performance	4	4085	June 1, 2009
Diagnosing error messages cudaError_enum CUDA Programming and Performance	1	1042	March 18, 2009
Kernel execution unknown error when number of blocks > 302 CUDA Programming and Performance	6	7145	June 28, 2009

blocks bigger than 512 threads I can't see the error

Related topics