Is it possible to display the CDP functions in Nsight Systems?

Is it possible to display the CDP functions (i.e. Call _global_ function from device) in Nsight Systems? Or is it possible to use Nsight Compute to display it?

If it is not possible, please let me know if it is possible to display them in Nsight Compute or other software.

When profiling the CUDA Dynamic Parallelism (CDP) example in NVIDIA Nsight Systems 2021.2.1, the GPU kernel was not displayed.
I used cdpQuadtree as the target for the profile.

I am sorry that there was no response to this earlier, your forum post was dropped in an orphaned category that the Nsys team was unaware of until this afternoon.

All of the tools that use NVIDIA’s CUPTI library for CUDA trace are unable to trace CDP functions since Volta. Unfortunately this includes both Nsight Systems and Nsight Compute.

1 Like

I apologize for posting in the wrong category.
Thank you for your answer.