kernel printf strange behaviour of printf in global sub

spiker · February 22, 2011, 10:49am

Hello, I have a strange behaviour of the cudaThreadSynchronize() function.
This is my source code and what it happens:

for (t=0;t<10;t++) {
	runneus<<<dim3(GRID / TBX, SLICES / TBY),dim3(TBX,TBY)>>>(neus,rnds,100,70,10.0);
	printf("%s\n",cudaGetErrorString(cudaGetLastError()));
	runsyns<<<dim3(GRID / CHX, SLICES),dim3(CHX ,NPRE)>>>(neus);
	printf("%s\n",cudaGetErrorString(cudaGetLastError()));
	reduces<<<dim3(GRID / CHX, SLICES),dim3(CHX ,NPRE)>>>(neus);
	printf("%s\n",cudaGetErrorString(cudaGetLastError()));
	getch();

	//cudaThreadSynchronize();
	printf("\n");
}

in this way no error is reported by the cudaGetLastError().

When I enable (erase the // remark) this is the output:

no error
no error
no error

unknown error
unknown error
unknown error

does anyone solved this error? What’s happening?
thankyou…

avidday · February 22, 2011, 12:06pm

Neither kernel launches, nor cudaGetLast error are blocking functions, and cudaGetLasterror only returns the error state of the CUDA runtime at the time it is called. What is probably happening is that the first kernel launches OK, then the others two queued successfully. On most sane platforms a kernel launch only takes about 10 microseconds, and I guess your execution time is considerably longer than that. The first kernel execution later aborts, either leaving an error message with the runtime, or killing the context altogether. The cudaThreadSynchronize call (which is blocking) forces the host to wait until the kernels are done, then the next loop trip calls cudaGetLasterror again and you get to see the error.

For debugging a kernel launch you should do something like this:

unneus<<<dim3(GRID / TBX, SLICES / TBY),dim3(TBX,TBY)>>>(neus,rnds,100,70,10.0);

printf("%s %s %s\n", __FILE__, __LINE__, cudaGetErrorString(cudaGetLastError()));

printf("%s %s %s\n", __FILE__,__LINE__,cudaGetErrorString(cudaThreadSynchronize()));

Â

for every kernel call you launch. That will serialize each launch and tell you which launch is failing and might give more information about the error. Once every thing works, strip out the blocking calls.

Topic		Replies	Views
How to check if kernel was launched? Is possible that kernel failed to launch but it was not recorde CUDA Programming and Performance	3	3342	March 8, 2010
cudaThreadSynchronize() error CUDA Programming and Performance	1	2988	October 5, 2009
Program hangs at cudaThreadsynchronize CUDA Programming and Performance	12	9708	April 7, 2011
Why does my kernel launch? CUDA Programming and Performance	5	6041	February 13, 2009
Async Kernel launch cpu seems not getting control after kernel launch CUDA Programming and Performance	7	3273	December 3, 2008
Random, occasional "unknown error" after kernel CUDA Programming and Performance	5	23161	July 30, 2011
incomprehensible behaviour limitations on kernel calls for host function? CUDA Programming and Performance	12	7155	April 28, 2011
Synchronization synchronizing a n body problem. CUDA Programming and Performance	8	4393	September 22, 2009
Unspecifiec launch failure on CUDA_SAFE_CALL(cudaThreadSynchronize()) CUDA Programming and Performance	5	2200	January 27, 2011
Can kernels return error messages CUDA Programming and Performance	8	4078	June 27, 2011

kernel printf strange behaviour of printf in __global__ sub

Related topics

kernel printf strange behaviour of printf in global sub