Debugging Dynamic Parallelism and preemption mode

randallr · March 22, 2013, 9:53pm

Trying to write some code that utilizes dynamic parallelism, the only settings I’ve changed has been to set “Generate Relocatable Device Code” to “Yes”, set “Code Generation” to “compute_35,sm_35”, and add “cudadevrt.lib” as an additional dependency. All these were necessary just to get code with a device-side kernel launch to build.

Unfortunately this seems to break the Nsight debugger. I now get the message (popping up from Nsight)

CUDA Dynamic Parallelism debugging is not supported in preemption mode. Breakpoints will be disabled

Not sure what exactly this means, tried with Aero disabled and same results. I was under the impression that debugging was supposed to still work with dynamic parallelism, is this not the case? Is there some other setting to change to get both dynamic parallelism and Nsight debugging to work?

seibert · March 23, 2013, 12:59pm

Based on another thread describing how debugging works, I think the problem here is that you are trying to debug the kernel on the same GPU that is running your display. This didn’t used to be possible at all, but NVIDIA found a way to allow some level of preemption of the GPU device so that it can halt your program in the debugger and still redraw the display. It sounds like this trick (since the card doesn’t really support preemption like a CPU) doesn’t yet work for dynamic parallelism.

The most direct solution I think is to get a second cheap GPU to use exclusively for your display, but hopefully someone knowledgeable about CUDA on Windows will chime in here…

Greg · March 24, 2013, 2:25am

Nsight Visual Studio Edition single GPU debugging mode (“preemption mode”) is not yet supported for applications using CUDA Dynamic Parallelism. Nsight VSE can debug CDP applications using remote debugging or local debugging where the CDP capable device is headless or configured to use the Tesla Compute Cluster (TCC) driver.

randallr · March 26, 2013, 5:03pm

Hey Greg, thanks for the response, just to clarify, can a second GPU be added to the local machine (not the same as the CUDA GPU) to drive the display and allow for debugging of dynamic parallelism?

Greg · March 26, 2013, 11:33pm

Randllr,

You an use any GPU to drive the display. The CC 3.5 device must be headless or configured to use the TCC driver.

Greg

randallr · March 27, 2013, 6:54pm

Any idea when/if debugging dynamic parallelism will be supported in “preemption mode”? The nsight page brags about debugging dynamic parallelism with absolutely no caveats.

AntonH · April 17, 2013, 4:26am

Thanks for the hint Greg,

I have the setup mentioned by randallr - I have headless Titan (no monitor hooked to it). My screen is being driven by a small AMD Radeon video card. And yet, when trying to debug a Dynamic Parallelism example I am experiencing the error.

I ensured I have the latest display driver (not the Tesla one). Do I understand correctly that I should be able to debug Dynamic Parallelism examples?

Also, when my computation is taking more than a few seconds, Windows (7) restarts my video driver. From what I understand, this is expected Windows behavior - is it correct?

Do I need to install the TCC driver to be able to debug and code against the Titan in this setup?

Regards,
Anton.

yarospo · July 28, 2013, 8:30am

@Anton

Hi Anton, I have the same setup and the same issue. Were you able to resolve yours?

Kind regards,
Yaro

vacaloca · July 28, 2013, 3:13pm

To resolve the issue of Windows 7 restarting the video driver, you need to disable TDR. See:
[url]https://devtalk.nvidia.com/default/topic/535264/cuda-programming-and-performance/kernel-runs-fine-on-osx-linux-crashes-on-windows-/post/3762516/#3762516[/url]

Make sure you’re using the latest CUDA 5.5 and the Parallel NSight that comes with it if you’re attempting to debug any code that has dynamic parallelism features also. There’s no concept of a TCC driver for anything other than a K20 or K20X, so if you have a Titan or a GK208-based NVIDIA card that supports CC 3.5, it might require to have a different (NVIDIA) GPU to drive the display if the same preemption error is still present.

I had a mix of an ATI (driving display) and NVIDIA GPU (CUDA) at one point and for a specific software at the time (Jacket) they did not play nice together, so that itself might be the issue.

Topic		Replies	Views
Nsight Visual Studio crash with multi-gpu setup and dynamic parallelism. CUDA Setup and Installation	1	1606	June 17, 2015
Dynamic Parallelism with a GeForce GTX 750 Ti CUDA Programming and Performance	4	1133	November 9, 2014
CUDA debugger does WDDM timeout at breakpoint CUDA Programming and Performance	6	1306	June 2, 2015
CUDA 5 Debugging Mode CUDA Programming and Performance	9	2253	July 1, 2012
NSight 4.7 trouble with Titan Z Nsight Visual Studio Edition	4	1913	July 22, 2015
CUDA debugging and profiling problems Nsight Visual Studio Edition	7	3543	January 19, 2017
How to profile dynamic parallelism CUPTI – CUDA Profiler Tools Interface	9	2337	November 29, 2023
Unable to debug CUDA samples in GTX1080 Nsight Visual Studio Edition	22	3905	July 21, 2017
The problem of nsight debugging in vs2015, win10. CUDA Programming and Performance	6	1391	June 16, 2017
Next-Gen debugger fails to start Nsight Visual Studio Edition	36	7105	March 16, 2018

Debugging Dynamic Parallelism and preemption mode

Related topics