I am using this command to generate the metrics for 1 kernel.
nvprof --metrics all --log-file log.txt --csv --profile-api-trace none myapp.exe
I get about 120 lines of output for the performance counters. Here is one of them.
“GeForce GTX TITAN X (0)”,“vectorAdd(float const *, float const , float, int)”,1,“sm_efficiency”,“Multiprocessor Activity”,65.837030%,65.837030%,65.837030%
I can’t figure out how to also get the kernel duration in this one call to nvprof.
Can anyone help me?
–Bob