Seems like found a bug, when i trying to use nsight systems to profile a cuda program, main problem code be like:
this program can work properly, but in nsight systems, there is no CUDA HW infomation, and streams, kernels too. But once i removed the return line, nsight systems works fine
my nsight systems version is 2023.3.1
Moved to Nsight System’s forum.
That’s an odd one. I’m wondering if we are getting those kernels properly from CUPTI. @mjain can you have someone check?
Does code flow always execute the cudaKernel? Is there a possibility that ‘if’ condition is true under certain conditions which are more likely to happen when you profile the code using nsight system? Does nsight systems profile the kernels before the if condition?
It would help to debug the issue if you can provide us with the minimal reproducer and nsys command line options.
Sorry, I tried to generate the minimal sample code, but it wont reproduce the problem. But in our project, its indeed occured