How can i determine Registers per work-item and Shared Memory Per work-group? I know that is possible with CUDA, what about OpenCL?
Thanks
Use NVIDIA’s Compute Profiler, it works for both CUDA C and OpenCL applications. Parallel Nsight might also be worth a look.
Danke schön External Image -Thank you very much