IIRC memory is protected on the GPU and the addresses are virtual. A GPU segfault should never bring the system down.
I know it has happened, at least on older drivers. Is it still an issue? Does someone have a minimal repro that hangs the system?
I’ve actually just written an app in which I intentionally go beyond array bounds. Kernel invocation returns an error (caught by cudaGetLastError) and either the screen blips momentarily or it goes black for a couple of seconds to return after a while with Windows saying that the driver has stopped responding and successfully recovered. I can relaunch the application after this.
This is on Windows 7 x64. I know Windows XP couldn’t handle a driver crash gently but it seems it’s not that bad now. It’s still nasty that a kernel error can bring the driver down…