Question about Compute Workload Analysis on Nsight Compute
It shows Pipeline Utilization (LSU, ALU, FMA, Uniform, CBU, ADU, FP16, FP64, TEX, Tensor(DP), Tensor (FP), Tensor (INT), XU)
Is there any documentaion about describe these?
I found the description found on StackOverflow. but I want to see the description on NVIDIA document.