Slow Paged Memory Transfer with M2090

I have a problem where paged memory transfers with the Tesla M2090 are very slow:

[font=“Courier New”] Device 0: Tesla M2090
Quick Mode

Host to Device Bandwidth, 1 Device(s), Paged memory
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 1832.4

Device to Host Bandwidth, 1 Device(s), Paged memory
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 1597.6
[/font]

But with pinned memory:

[font=“Courier New”] Device 0: Tesla M2090
Quick Mode

Host to Device Bandwidth, 1 Device(s), Pinned memory
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 5735.8

Device to Host Bandwidth, 1 Device(s), Pinned memory
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 5534.9
[/font]

Any ideas what could be causing this or if there is any way to speed up paged memory? I’m running Centos 6.2 and the NVidia driver 295.49. I have 36GB of RAM and 2 CPUs (Xeon E5630). The system is a Supermicro 1026GT-TF-FM209:
http://www.supermicro.com/products/system/1U/1026/SYS-1026GT-TF.cfm?GPU=FM209

The odd thing is that even the the M2090 is “top of the line”, the paged memory transfers are slower than EVERY other CPU board I have (Tesla C1060, GTX 280M, GTX 460, GTX 480, GTX 580). This is frustrating because we bought 6 systems, each with 2 M2090s, and all seem to suffer from the same slow paged memory transfers.

Thanks!