Trying to write some code that utilizes dynamic parallelism, the only settings I’ve changed has been to set “Generate Relocatable Device Code” to “Yes”, set “Code Generation” to “compute_35,sm_35”, and add “cudadevrt.lib” as an additional dependency. All these were necessary just to get code with a device-side kernel launch to build.
Unfortunately this seems to break the Nsight debugger. I now get the message (popping up from Nsight)
CUDA Dynamic Parallelism debugging is not supported in preemption mode. Breakpoints will be disabled
Not sure what exactly this means, tried with Aero disabled and same results. I was under the impression that debugging was supposed to still work with dynamic parallelism, is this not the case? Is there some other setting to change to get both dynamic parallelism and Nsight debugging to work?