I’m not able to get the instructions_issued or instructions_executed counters to work with my M2050 and with CUDA 3.2 code that has been compiled with -arch sm20 for compute capability 2.0. I’m using CentOS 5.4 and driver 260.19.29.
I’ve enabled the CUDA profiler with:
and in my config file I’ve specified
But in my profile output, I’m getting the warnings:
NV_Warning: Ignoring the invalid profiler config option: instructions_issued
NV_Warning: Ignoring the invalid profiler config option: instructions_executed
The Compute_Profiler.txt file that comes in the CUDA documentation
says that these counters are available for compute compatibility 2.0,
and I’m compiling with -arch sm20. Also, I’m using other counters (l1
and l2) that are compute 2.0 only.
Has anyone had similar problems to this? Any ideas on what is wrong
with my configuration? Or is the documentation incorrect?
Thanks for your help,