NVIDIA announces that K20m’s DP/SP performance is 1.17/3.52 Tflops.
As we known, K20m has 13 SMX(2496 CUDA cores). These cores work at 0.705GHz.
I calculate peek performance like this:
DP performance: 2496 * 0.705 = 1.759 Tflops
SP performance: 2496 * 0.705 * 3 = 5.279 TFflops.
These results are far more than 1.17 and 3.52.
Why?
Thanks.