resetting cuda context after kernel failure?

Saul · September 12, 2008, 11:05pm

Hello,

When a kernel fails, returning an “unspecified launch error”, all future calls to the card [Edit: in that same process], including memcpy’s, fail with the same result. How do I reset the current thread’s context so I can run further code? I’m calling cudaGetLastError and cudaGetErrorString (which is how I get the error message), but it never resets it to Success like the documentation suggests it should. I’ve even tried cudaThreadExit (which doesn’t fail), because the documentation says that any subsequent call reinitializes the runtime; but subsequent calls continue to return the same unhelpful error code.

I’m storing debug data in global memory which would be very useful for post-crash analysis, if I could only get at it.

Does anyone know how to reset the current thread’s context using the Runtime API, so that the card may be used again?

Thanks,
Saul

[Edit: This is on 64-bit Ubuntu 8.04.]

cbuchner1 · September 13, 2008, 9:08am

Having the same problem on Windows XP Professional. When I get the unspecified launch failure, I typically get it until I reboot.

Christian

Topic		Replies	Views
Reset context after kernel failure? CUDA Programming and Performance	2	1063	August 11, 2017
How to reset CUDA error in driver API CUDA Programming and Performance	5	7809	February 18, 2014
Reset CUDA error when using driver API cuXXX functions keep returning the same error code CUDA Programming and Performance	0	1318	July 7, 2010
Clearing Cuda Errors CUDA Programming and Performance	6	11504	December 1, 2009
application crash and device memory CUDA Programming and Performance	4	1134	August 17, 2010
cudaThreadExit() cleanup Does it work? CUDA Programming and Performance	6	29807	May 27, 2009
How to proceed / reset device from an error 700 or 716 in cuda driver API? CUDA Programming and Performance	10	2251	October 12, 2021
continue after error cutilSafeCall aborts program on error CUDA Programming and Performance	4	1938	January 4, 2010
How to re-init the context after cudaResetDevice, now ERROR: cudaErrorContextIsDestroyed CUDA Programming and Performance	8	1211	August 24, 2023
cudaThreadExit not working Bug Report CUDA Programming and Performance	3	8466	June 24, 2010

resetting cuda context after kernel failure?

Related topics