Kernel launch timeout

That is the timeout issue. The output is incorrect, it should be pi (that’s one of the reasons I use this test).

I’ve run this on many other GPUs without issues.