Profile pytorch model using NCU


I have established a pytorch model and wanna to profile each layer or operator to get memory occupation, pcie bandwidth and GPU utils and so on when making inference. How can I do that using Nsight Compute or is there some available methods?


Please refer the following docs,

If you still need further assistance we will move this post to Nsight related forum.

Thank you.