LaunchGrid issue. failure after successful LaunchGrid.

shsanjp · May 14, 2008, 3:55am

Hi guys,

I am quite new with CUDA but there is something I have difficulties figuring out.
I have a kernel that I can load fine, all set parameters seems to be ok and the LaunchGrid is returning CUDA_SUCCESS however the next function is returning CUDA_ERROR_LAUNCH_FAILED :(

I made sure that I was calling cuFuncSetBlockShape too so it is not that.

I am a bit surprised that the launch failed actually after the LaunchGrid call itself since I am not doing an asynchronous call… (well at least I would not expect it to be).

The call I am doing after the LaunchGrid is a cuMemcpyDtoH but in fact the same thing seems to happen whatever I do (like cuCtxSynchronize).

As far as I am aware the kernel has been compiled fine and is not even a big one (using 18 reg and 52bytes of shared mem), my block size is between 128 and 256 and the grid is something like 16x400.

Is there a way to get more information about the exact reason of the failure?
Any help would be appreciated.

Updated:
Here are some information about my config just in case it helps.
XP64, Cuda 2.0beta, GF8800GTX+GF8600 both as display, Cuda running on GF8800GTX.
8GB ram.

I have also removed most of the code (now just calculating and address and updating the output buffer) for the kernel and it still fails in the same way.
I have also remove all textures code from both the cpp and the cu files.
I have really no idea about what is going on.
Thanks.
Laurent.

AndreiB · May 14, 2008, 6:50am

It is normal that cuLaunchGrid() returns success and cuCtxSynchronize() returns actual error code. All kernel calls are async!

As for the reason of your failure it’s hard to tell why it’s happening. It might be timeout issue ( ~5 sec ) or most probably you’re trying to read or write some unallocated memory (write past end of array, for example).

It would be better if you could post source code here (both kernel and host code responsible for calling).

shsanjp · May 14, 2008, 6:52am

Ok it is definitively not a timeout issue since it is returning well inside the 5s.

I will check the writes (only thing left in my shader).
I will try to filter the code a bit to be able to paste it.

Thanks.

shsanjp · May 14, 2008, 7:00am

Getting closer.
I have no idea why but it seems that CUDA doesnt like my last parameter which is a float.
If I remove it then it works.

One thing I am a bit worried is the pointer size.
My first parameter is a pointer but it seems to be passed as a 32bit (which seems reasonable for a GPU).
Is that correct or should it be a 64bit pointer too.

shsanjp · May 14, 2008, 7:41am

Got it working now.
I had to reduce the amount of parameters passed to my function.
I had 6 or 7 then after reducing it to 5 it worked.

Is there a limit?
I must have missed it in the documentation.

Anyway now next issue…

Thanks.

AndreiB · May 14, 2008, 9:59am

No, even if there is some limit on number of kernel parameters it is much higher than 6 or 7…

Topic		Replies	Views
cuda launch failed CUDA Programming and Performance	0	1519	March 13, 2009
Cuda KERNEL_LAUNCH_FAILED when I call the same kernel immediately after the previous call took place CUDA Programming and Performance	4	1041	December 14, 2012
Kernel randomly fails to launch after several thousand successful launches CUDA Programming and Performance	4	2553	September 25, 2009
problem launching kernel with cuLaunchGrid CUDA Programming and Performance	2	3860	July 15, 2009
Why does my kernel launch? CUDA Programming and Performance	5	5989	February 13, 2009
Launching Kernel Fail CUDA Programming and Performance	15	3414	May 28, 2014
cudaErrorLaunchFailure -- potential causes? CUDA Programming and Performance	1	6681	June 2, 2010
launch terminates in cudaDeviceSynchronize() after timeout CUDA Programming and Performance	2	1147	July 19, 2014
Problem launching kernel with driverapi CUDA Programming and Performance	1	1396	April 7, 2009
Unknown Error CUDA Programming and Performance	4	5906	October 17, 2018

LaunchGrid issue. failure after successful LaunchGrid.

Related topics