PTX instruction statistics collector

Dear everyone,

Is there any tools in linux that can analyse CUDA ptx code and collect the statistics on ptx instruction? I mean the times that every ptx instruction shows in one CUDA program.

Thank you all!