I have a CUDA kernel, modified OpenCV 3 stereoBM https://docs.opencv.org/3.4/d9/dba/classcv_1_1StereoBM.html. It takes two large images (48 megapixels) as input.
When it runs on Windows 10 1607 machine with NVIDIA GTX 1050 Ti, the launch times out and is terminated. It appears that the launch is terminated when the code runs for more than 2 seconds. The GUI does not freeze during the execution.
However, there is no termination on Windows 10 1809 machine with NVIDIA GT 1030 and the code may run even for 11 seconds. GUI does not freeze too.
The first machine uses TdrDebugMode registry key set to 3 and the second one uses the same key with the same value plus TdrDelay set to 2.
The question is why timeout is detected only on the first machine but not on the second machine?
I run another bit of code: CUDA kernel which consist of a single infinite loop. Execution of this code is terminated only on the first machine.