hi,
I’m trying to use nvidia’s opencl profiler from command line. It seems that I’m not able to get the register count.
I use
export OPENCL_PROFILE_CONFIG=config
and config contains regperworkitem
when I look at the *.log files however I get this:
“NV_Warning: Ignoring the invalid profiler config option: regperworkitem” and so I get only the values for the default counters (method,gputime,cputime,occupancy)
Wouldn’t it be great if OpenCL used CUDA terminology… I find it confusing when reading CUDA and OpenCL stuff as I always forget which terms mean the same thing.
Did you log this in the NVDeveloper bug report system?