How to Display Source Code Alongside SASS in Nsight Compute for Custom CUDA Kernels called from Pytorch/Python Autograd?

mahmoud52623 · November 6, 2024, 12:51am

Hello, I would really appreciate any help or shared experiences with profiling custom CUDA kernels integrated with PyTorch’s autograd! I’m working on a project where these kernels are compiled and used with PyBind11 and setup.py to integrate with PyTorch’s autograd for training. These kernels are called from train.py during the loss.backward() operation, which launches the backward pass and gradient calculation. For compilation, I use the -g, -lineinfo flags too. However, when profiling these CUDA kernels with Nsight Compute, I can only see the SASS and not the CUDA source code displayed alongside it. I have run Nsight Compute with the “–import-source yes” option too. Has anyone successfully configured Nsight Compute to display the CUDA source code alongside SASS for such kernels compiled? Any tips for verifying that the compiled binaries include the necessary source code information for Nsight Compute would be greatly appreciated.

I have a Windows environment, CUDA 12.3, Nsight Compute 2024.3, Pytorch 2.3.1+CU12.1.

Thank you in advance for your help!

veraj · April 21, 2025, 9:16am

Hi, @mahmoud52623

Do you use numba (or some other Python lib) to write the kernel?
Or implemented in C++ directly (sounds like it)?
In case of the latter, if the C++ sources are correctly built with -lineinfo, we expect the SASS view to be correlated to a C++ file rather than a Python file.
So we are a little confused about your scenario.

If possible, please share your repro and then we can see how to help.

Topic		Replies	Views
How to use Nsight compute "source counter" to capture the CUDA kernel code in PyTorch? Nsight Compute kernel , pytorch	2	762	November 15, 2023
Python Source file not displayed - only SASS view Nsight Compute	11	140	January 24, 2025
Unable to observe CUDA-C files in Nsight Compute Nsight Compute	1	537	January 30, 2020
Nsight in Visual Studio does not display Cuda Source View in Profiler CUDA Programming and Performance	3	933	December 13, 2018
how to embed source code information with nsight compute cli Nsight Compute	2	3764	May 6, 2019
"For loop" not having SASS code CUPTI – CUDA Profiler Tools Interface	3	543	June 8, 2023
Nsight Compute with Pytorch Nsight Compute pytorch , profiling	4	527	August 23, 2024
Cannot see source of some kernels Nsight Compute	3	1712	November 29, 2021
Source code not available in ncu-ui Nsight Compute	4	1591	February 21, 2022
How to view source code side by side with SASS? Nsight Compute	11	2219	July 18, 2023

How to Display Source Code Alongside SASS in Nsight Compute for Custom CUDA Kernels called from Pytorch/Python Autograd?

Related topics