just read the new CUDA FAQ and was very surprised about bandwith test results.
Example measured numbers for a Core 2 Duo processor, ASUS P5N32-SLI motherboard with 1GB memory and a GeForce 8800 GTX are: Pageable Page-locked Host - Device 1.7 GB/sec 3.1 GB/sec Device - Host 1.7 GB/sec 3.1 GB/sec Device - Device 70.7 GB/sec 70.7 GB/sec
On my office system with 8800 GTX bandwithTest --dtod result is 9.4 GB/sec.
(2x Xeon 3.6 Ghz, Intel E7525 Chipset, 8GB, WinXP pro).
Big difference to the FAQ results.
At home on 8800GTS/640, C2D E6400, ASUS P5LD2-C (i945P), 2GB:
~ 3.5 GB/sec on Linux (it’s a not supported Ubuntu 6.10)
~ 7 GB/sec on WinXP Home
So maybe Mark Harris used a newer CudaKit/SDK with much inprovements ?
How about the release candidate of CUDA 1.0 ?