I got the 14 error when I am trying to profile my GPU P100 metric: flop_count_sp. I’m following the CUPTI document to enable the 3 events needed for flop_count_sp:
352321813 : flop_count_sp_
352321814 : flop_count_sp_
352321815 : flop_count_sp_
I got this 14 error when I use the API cuptiEventGroupSetEnable
or cuptiEventGroupEnable
. When I am using nvprof query-metrics
, I saw flop_count_sp. This means my P100 GPU supports flop_count_sp. However, the 3 events needed did not show up when I use nvprof query-events
. Does this mean my P100 does not support the enabling of these 3 events? I could use nvprof --metrics --metrics flop_count_sp Appliaction
to get the flop result I need.
In this case, how should I code using c++ to profile flop_count_sp?