How to check if kernel was launched? Is possible that kernel failed to launch but it was not recorde

Cygnus_X1 · March 7, 2010, 6:50pm

My question is - if the following is the correct way to check if a kernel was launched:

__global__ void suspiciousKernel(int *i) {

  *i=123;

  __syncthreads();

  [...] //some big code goes over here

}

int main() {

	cudaError err;

	int *gpuI;

	int cpuI=42;

	err=cudaMalloc( (void**)&gpuI, sizeof(int));

	printf("Allocate: %s\n",cudaGetErrorString(err));

	err=cudaMemcpy( gpuI,&cpuI,sizeof(int),cudaMemcpyHostToDevice);

	printf("Send: %s\n",cudaGetErrorString(err));

	suspiciousKernel<<<1,512>>>(gpuI);

	err=cudaThreadSynchronize();

	printf("Launch: %s\n",cudaGetErrorString(err));

	err=cudaMemcpy(&cpuI,gpuI,sizeof(int),cudaMemcpyDeviceToHost);

	printf("Receive: %s\n",cudaGetErrorString(err));

	printf("Got value %d\n",cpuI);

}

According to Programming Guide:

So I would expect that if my kernel call crashes or is not executed for whatever reason, I will get err different than cudaSuccess out from cudaThreadSynchronize.

On the other hand, if the kernel is executed, I should now have value 123 under gpuI pointer, assuming my “some big code goes over here” does not modify (or even read/depend on) the value. What I get out from the above code is:

Allocate: no error

Send: no error

Launch: no error

Receive: no error

Got value 42

So my question is - what must happen so that I have these results?

Some notes:

I launch only one block of my suspicious kernel so __syncthreads() stops all threads on whole GPU.
It could happen that I change *i accidently, but if that happens what are the odds of setting it back to the old value?

Cygnus_X1 · March 8, 2010, 8:56pm

Update: The reason the kernel didn’t launch was that it was using 33 registers and it was too much for this launch configuration.
Still, how am I to detect, at runtime, that kernel failed to launch if not the way I have shown above?

tmurray · March 8, 2010, 9:01pm

call cudaGetLastError after your kernel launch; launch errors are not sticky (and therefore cudaThreadSynchronize will return cudaSuccess) because they do not result in the context being destroyed.

Cygnus_X1 · March 8, 2010, 9:37pm

Tested it out. Got

]Launch (get last error): too many resources requested for launch

Now I am happy :) Thank you!

Thought that errors from kernell calls are catched by the next first function which may return an error (that “Note that this function may also return error codes from previous, asynchronous launches” sentence). Obviously I was mistaken.

Topic		Replies	Views
Why does my kernel launch? CUDA Programming and Performance	5	6058	February 13, 2009
kernel printf strange behaviour of printf in __global__ sub CUDA Programming and Performance	1	3966	February 22, 2011
launching fail detection CUDA Programming and Performance	1	3157	November 12, 2008
Async Kernel launch cpu seems not getting control after kernel launch CUDA Programming and Performance	7	3279	December 3, 2008
Synchronization synchronizing a n body problem. CUDA Programming and Performance	8	4395	September 22, 2009
Kernel launch error checking CUDA Programming and Performance	0	1280	April 14, 2013
Catching errors with kernel execution How to detect failure cases CUDA Programming and Performance	3	3479	July 8, 2008
Cuda KERNEL_LAUNCH_FAILED when I call the same kernel immediately after the previous call took place CUDA Programming and Performance	4	1117	December 14, 2012
"max threads exceeded" error isn't reported CUDA Programming and Performance	1	3663	October 13, 2011
cudaDeviceSynchronize not returning error of type "invalid configuration argument" CUDA Programming and Performance	2	1743	March 12, 2014

How to check if kernel was launched? Is possible that kernel failed to launch but it was not recorde

Related topics