How to Profile the Open CL Code


I am new to CUDA and open CL language.

I have installed CUDA sdk 4.0 in my system and I am using visual studio 2008 to build my open CL code. The code which I have written in open CL is working fine. I want to know the steps to profile the open CL code.

Please help me.

Thanks in advance