Hi,
I’m not able to get the instructions_issued or instructions_executed counters to work with my M2050 and with CUDA 3.2 code that has been compiled with -arch sm20 for compute capability 2.0. I’m using CentOS 5.4 and driver 260.19.29.
I’ve enabled the CUDA profiler with:
export COMPUTE_PROFILE=1
export COMPUTE_PROFILE_CONFIG=profile_config
and in my config file I’ve specified
instructions_issued
instructions_executed
gld_request
l1_global_load_miss
l2_subp0_read_sector_misses
But in my profile output, I’m getting the warnings:
NV_Warning: Ignoring the invalid profiler config option: instructions_issued
NV_Warning: Ignoring the invalid profiler config option: instructions_executed
The Compute_Profiler.txt file that comes in the CUDA documentation
says that these counters are available for compute compatibility 2.0,
and I’m compiling with -arch sm20. Also, I’m using other counters (l1
and l2) that are compute 2.0 only.
Has anyone had similar problems to this? Any ideas on what is wrong
with my configuration? Or is the documentation incorrect?
Thanks for your help,
Mark Howison