Hi,
I just set up an S870 on Centos 5.0 with the 177.67 drivers and Cuda 2.0. It works fine, but I’m getting poor device-device bandwidth results. bandwidthTest from the SDK reports the following:
Running on…
device 0:Tesla C870
Quick Mode
Host to Device Bandwidth for Pageable memory
.
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 1988.4
Quick Mode
Device to Host Bandwidth for Pageable memory
.
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 1739.2
Quick Mode
Device to Device Bandwidth
.
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 31036.3
In contrast, the same test on a G80 on my desktop with the same OS, drivers and Cuda gives 65 GB/s. It seems like other people are having the same problem: [url=“http://forums.nvidia.com/index.php?showtopic=75817&hl=s870”]http://forums.nvidia.com/index.php?showtopic=75817&hl=s870[/url].
The overall performance of my application is about 60% on the S870 as compared to the G80 on my desktop, but the results are still correct.
I’ve attached the output from nvidia-bug-report as well. Does anyone have any ideas as to what might be going wrong?
nvidia_bug_report.log.gz (22 KB)