For quite some time now I sensed that the data transfers from host to device and back are way too slow and that the bottleneck is really problematic for me.
I’m using GeForce 8600 GT on HP Compaq 6100 MT, in a PCI Express slot.
If I understand it correctly then the max speed of the PCI Express (which the GPU should be using to the fullest) is 4GB/s.
I’m taking this bit of information from here:
However, when I run bandwidthTest.exe from the SDK I get host-to-device bandwidth of ~600-650 MB/s and device to host of ~750-800 MB/s. These transfer rates seem compatible with the results I’ve measured in other CUDA code I wrote in the past.
My question is, is this normal? These speeds are almost 1/8 of the max potential (if I understand the max potential correctly). Is this a faulty card or motherboard?