Classifying Instructions

Hello all and thank you for your help in advance,

I have several benchmark CUDA apps which I want to break-down according to the instructions contained within them. For example, BFS is 45% ALU, 10% SFU, 20% CF, 15% LD/ST, 10% Other. Which tool/method would be applicable in generating this data?

Thank you very much.