While running nvprof in Ubuntu 14.04 and recording inst_executed and inst_fp_32 metrics, I noticed that inst_fp_32 returns a much larger value than inst_executed. Isn’t inst_executed the total number of instructions including inst_fp_32 and other instructions?
If not, how do I record the total number of instructions executed in a process? My aim is to get the %age of FP32 instructions executed out of the total instructions executed in a process.
inst_executed: Avg - 1014985216
inst_fp_32: Avg - 28200517632