We observe a strange discrepancy between the basic benchmark for the Tesla C2070 on the one hand and the the Geforce GTX 285 on the other. To be specific the Tesla C2070 gives worse performance than the Geforce GTX 285 for matrixMul (from the SDK).
GTX: 226 Gflops/s
C2070: 183 Gflops/s !!
The bandwidth test also gives worse results on the C2070
Does anyone have seen similar results? any idea on what could be done to improve the performance? It appears that the $250 is way better than a $4000 card. Are we missing something?
The C2070 is a Padova system with the following specs:
2x Nehalem 4C E5530 @ 2.4 GHz
24GB @ 1333 mHz memory
Ubuntu 10.10 64bit
The GTX card is on:
Intel Xeon @2.00GHz