Write-Combining memory can slow down your application?

iceberg · January 14, 2010, 6:49am

In NVIDIA_CUDA_ProgrammingGuide_2.3.pdf, it says that " write-combining memory is not snooped during transfers across the PCI Express bus, which can improve transfer performance by up to 40%."

But when I run the bandwidthTest.exe , it turns out that the write-combining memory is no help in improving transfer performance. The host to device bandwidths for pinned memory are totally same.
( 1st test: “bandwidthTest -memory=pinned -wc”, 2nd test: “bandwidthTest -memory=pinned”)

And the worse thing is that the bandwidth will get slower if you use write-combining memory when copying data from host pageable memory to host write-combining memory. So when you use write-combining memory, the whole application performance is degraded.

Is this result reasonable?

Keldor314 · January 14, 2010, 8:09am

Well, when it’s write combined, it’s not cached on the CPU, so it makes sense that the host side bandwidth to the buffer in question would be reduced. Thus, it makes the PCIe transfer faster at the expense of CPU access time.

iceberg · January 14, 2010, 9:21am

But according to my test, the PCIe transfer speed does not change whether the host memory is write combined or not. When should we use write-combining memory ? By the test result, it seems useless.

tmurray · January 14, 2010, 9:26am

Depends on the chipset.

iceberg · January 14, 2010, 9:38am

Thank you , tmurray!

What kind of chipset will unleash this potential ? Mine is Intel 5520 chipset.

iceberg · January 15, 2010, 2:34am

Which chipset does it depend on , GPU chipset , main board chipset or both?

Topic		Replies	Views
Write Combined Memory How it enhences performance? CUDA Programming and Performance	2	15528	March 24, 2010
About Data transfer speed between CPU and GPU? How to increase the data transfer speed? CUDA Programming and Performance	7	15530	December 11, 2009
Improving data transfer performance from host to device CUDA Programming and Performance	2	2060	January 28, 2015
Bandwidth problem ? Could anyone verify that this is normal? CUDA Programming and Performance	7	3579	April 25, 2008
How to transfer massive data efficiently? CUDA Programming and Performance	5	5814	April 16, 2015
Why i can't use my full PCI-Express bandwidth? CUDA Programming and Performance	7	5043	December 17, 2020
Pinned and Pageable memory CUDA Programming and Performance	5	2419	January 16, 2020
bandwidthtest: pageable vs pinned memory CUDA Programming and Performance	4	1658	February 18, 2010
bandwidthTest anomaly! CUDA Programming and Performance	4	10877	July 31, 2009
question about page locked memory CUDA Programming and Performance	2	8763	April 21, 2009

Write-Combining memory can slow down your application?

Related topics