I’m working on a kernel that uses an infinite ( while(true) ) loop, with a break somewhere in the middle indicating when the process should stop. The number of iterations is not yet known before the loop, and would be very expensive to calculate.
For some reason, nvcc compiles this loop to no code at all, as it’s not in the PTX. It just plainly throws it away. I don’t get it…