I have being testing a toy program on my integrated Nvidia card (compute capability 1.1), and found that the instruction throughput = 1.11475.
Does this make any sense? Thanks!
From the Visual Profiler User Guide:
“instruction throughput: Instruction throughput ratio. This is the ratio of achieved instruction rate to peak single issue instruction rate. The achieved instruction rate is calculated using the “instructions” profiler counter. The peak instruction rate is calculated based on the GPU clock 0-1 speed. In the case of instruction dual-issue coming into play, this ratio shoots up to greater than 1. This is calculated as gpu_time * clock_frequency / (instructions)”