I meet a very strange behavior on 2080Ti.
In my project, I use llvm to generate IR, and output ptx file by NVPTX, and use cuda API load the ptx file, and do the launch kernel.
The problem is that, my program run incorrectly only on 2080Ti, and It works well on P100, 1080.
Then I want to debug my program, and Insert Printf instruction on the llvm IR, then strange thang happens, it works well on 2080Ti after I insert the Printf instruction.
OS : ubuntu 16.04
cuda driver : 415.27
cuda runtime : 9.0
sm arch : sm_60
looking forward to any replay.