Could nsight compute really trace the OpenAI-triton code?

As the @triton.jit kernels have been compiled, can the source and SASS catch the relationship between metrics like (live regs) (stall samples)?

Hi, @pannenets.f

Sorry I don’t quite get your question. What exactly you want to know ? Which kind of relationship do you refer to ?

The source and SASS panel show the color bars along with the Python code.

e.g.

@triton.jit
def foo(
  x_ptr,
  y_ptr,
):
  cur_loc = tl.program_id(0)      # color bar 1
  xx = tl.load(x_ptr + cur_loc)   # color bar 2
  yy = xx * xx + xx + 1           # color bar 3
  tl.store(y_ptr + cur_loc, yy).  # color bar 4

Could the color bars reflect things like the register pressure of corresponding lines? As the triton is compiled before running, I wonder the accuracy of tracing.

Thanks! @veraj

Yes the Source Page in Nsight Compute will show metrics at the source and SASS instruction level.

Refer the Nsight Compute document Source page section.

Hi, @pannenets.f

Sorry for the late response. Have checked with internal dev, the data is showed to python source to the best that the line table information can provide.

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.