Hi
in the output of “nsys profile --trace=cuda”, I see that kernels with long names are truncated? How can I get mote information about the kernel name? Because of the multiple arguments, the exact kernel name is somewhere in the long template name.
4,0 331.322.797 450 736.272,0 735.556 736.933 void cutlass::Kernel<cutlass_tensorop_s1688fprop_optimized_tf32_64x256_16x4>(cutlass_tensorop_s1688…