OpenCL profiler on NVIDIA GPU

I am working with OpenCL on NVIDIA GPU.

Is there any profiler for OpenCL on NVIDIA GPUs. I found on internet that nvvp (nvidia profiler can work with some hacks, but it does not work anymore).
Additionaly, I can get some results of the resources that my application used with the ACCOUNTNG mode of nvidia-smi, but it does not give me detailed information (I get the gpu resource usage only as percentage)
I need more specifical profiling information like exact numbers of SMs and cores, number of register per thread, memory amount etc…
Is there any way to achieve it with opencl on NVIDIA GPU?

Thanks you