Hi, i want to run some benchmarks on remote systems. for that i would like to turn on profiling, like we used to do with cuda before the visual profiler days :) …
will CUDA_PROFILE=1 work on opencl as well ? or are there other variables needed for opencl ?
OpenCL is all about run-time configuration… Profiling can be enabled with an api call. Simply search “profiling” in the Khronos OpenCL specification and you’ll get what you want:
5.9 Profiling Operations on Memory Objects and Kernels
Thanks pobelzile, i know about that, but i was looking for a way of profiling applications that i didn’t write and don’t want to modify in any way … and since i know that the visual profiler activates such a mode it’s only a question of how :)