I’ve noticed in the FAQ that NVIDIA gave some figures about the bandwidths we should get by running the example kernel from the SDK called bandwidthTest. And it made me realize that I’d never tried before. Here is the example given by NVIDIA:
And here is what I got:
I’m kind of very worried about device-device bandwidth which is very low. Any ideas? I’ve got a Dell Precision 690 with a Dual Core Xeon 3.0 GHz and a GeForce 8800 GTX running KUbuntu Edgy (32 bits). And I did the test with CUDA 0.8.