I tested the ‘callback_metric’ example from CUPTI samples directory (i.e. NVIDIA GPU Computing Toolkit\CUDA\v5.5\extras\CUPTI\sample). The metric I gave is ‘achieved_occupancy’ and the result is larger than 1, which is apparently wrong.
My objective is to port out the profile results from Nsight, but the .csv file doesn’t contain all detail numbers. So I plan to use CUPTI and write the interested metrics to files in the program. I think CUPTI is the right API to do this. Anyone knows the reason for the wrong results for the sample code?