I have not found this function when using compute tool, I want to know whether/when those hardware engines are active/inactive, for example tensor cores and cuda cores in SM, is this idea possible?
Please see if “System trace” activity can meet your requirement.
Why this function is disabled in my envrionment, :(
It is weird. It should not be disabled. Can you get a latest Nsight Compute version to check ? NVIDIA® Nsight™ Compute 2023.3 is now available
Application: NVIDIA Nsight Compute
Version: 2023.3.0.0 (build 33266684) (public-release)
Yes，I use the latest version as above. But I use ssh connect way to profile my kernel, it that the reason?
Yes. That’s the reason. We don’t support remote trace in Nsight Compute now.
I read this introdution about the “System Trace”, but it is not what I want to get.
Actually, what I want to get is all compute engines(including cuda cores and tensor cores)'s activity in timeline withine a kernel, interaction between CPU and GPU is not included.
Thanks for confirmation. I will check with our dev and reply you back soon.
have you get any feedback from your dev?
Sorry for the late response ! A new feature of PM Sampling in Nsight Compute (see https://docs.nvidia.com/nsight-compute/ProfilingGuide/index.html#pm-sampling for more details) was first introduced with version
2023.3.0. This allows users to collect metrics over time during the kernel execution. This will support also the metrics you requested in the comment from Nov 9.