First, 702 launch timeout error occurs.
Are you on Windows? Then your WDDM driver’s watchdog timer has killed your kernel. You options are to increase the watchdog timeout, but this isn’t recommended since it involves editing your registry, or use a Tesla GPU with the TCC driver.
And after trying to run the code several times, the error code changes from 702 to 999.
999 is error “unknown”.
Then, 716 misaligned address error starts to occur.
These all point to some type of uninitialized memory or other memory problem.
Try running your binary under the “cuda-memcheck” utility to see if anything useful is detected. It doesn’t always help, but might give some clues.