Measure number of individual instructions executed

Hello,

I would like to profile my applications on the NVIDIA Drive AGX in a way that I see how many floating point instructons, integer, etc. were exectued after passing the DCO. Is there any way to do that?

Thank you &
Regards

Dear markus.kohout,
Nvidia visual profiler can count certain events and metrics. Please check nvprof --query-events and nvprof --query-metrics to see if it helps.

Thank you for your response. This seems only to work for CUDA applications. What if I just want to profile these metrics for a CPU application?

→ number of floating point operations etc. on the CPU? Utilization of the ARM NEON units?

Dear markus.kohout,
Tegra stats can show the utilization of CPU cores and Tegra system profiler can be used to know performance metric counters. Please check https://docs.nvidia.com/drive/active/5.1.0.2L/nvvib_docs/index.html#page/DRIVE_OS_Linux_SDK_Development_Guide%2FUtilities%2Fmap_to_tools_common.html%23 for more details.