Memory copy speed

darot · March 30, 2009, 3:48am

I met some problem of memory copy speed. But I don’t know why?
I have two pcs whose setup are as:
PC1(workstation):
CPU:Xeon 3.4G*2
VGA:GTX 285(compute capability 1.3)
CUDA version:2.2Beta
Bandwidth calculated by cuda example
Host to Device:920MB/Sec
Device to Host:872MB/Sec

PC2(IPC):
CPU:Core2 2.4G Q core
VGA:9800GTX(compute capability 1.1)
CUDA version:2.0
Bandwidth calculated by cuda example
Host to Device:1766MB/Sec
Device to Host:1433MB/Sec

My code is to copy a 81925000 unsigned short data from host to device.
and then copy 2 81925000 char data from device to host.

the transfter time is very strange:
PC1:63ms
PC2:74ms

Why?I think the copy time depend on bandwidth, but it seems not to be like what I said.
Why?any thing I missed?

navier-stokes · March 31, 2009, 7:06am

Hi,
you are transfering very small packages of 40MB*sizeof(unsigned short). May be time measurement is not prcise enough due to coarse time steps and latency effects of the memory transfer.

YDD · March 31, 2009, 5:37pm

Also, unless you’re using pinned memory (doubtful given the figures from the CUDA SDK bandwidthTest), you’re also timing the CPU’s memory subsystem, since the CUDA runtime has to copy data into its own pinned memory buffers prior to the PCIe transfer. FWIW, on my machine, the latency of a PCIe transfer from pinned memory is about 10 microseconds.

Pimbolie1979 · April 2, 2009, 7:31am

My PC is a:
CPU = Core i7-920 (Quadcore with 2.8GHz)
Memory = Tripple Channcel DDR3-1600MHz
VGA-Card =NVIDIA 9800GTX+ with 512MB Memory

I use Windows XP and Cuda 2.1

I had test the bandwith with the CUDA bandwithexample (“NVIDIA Corporation\NVIDIA CUDA SDK\bin\win32\Release\bandwidthTest.exe”)

My Results are:
5200MByte/second from PC to GPU
4700MByte/second from GPU to PC

Now I will test a NVIDIA 285GTX.

Can anybodypost your results?

Topic		Replies	Views
CudaMemcpy() speed/bandwidth For host to device CUDA Programming and Performance	5	10028	June 30, 2009
Bad PCIe transfer performance (cudaMemcpy), what can cause that? CUDA Programming and Performance	10	11610	September 20, 2010
device to device bandwidth confusion? CUDA Programming and Performance	4	2320	February 26, 2009
Bandwidth is too slow so cudaMemcpy() takes too long. CUDA Programming and Performance	15	7560	December 12, 2012
cudaMemcpyDeviceToHost time procces CUDA Programming and Performance	6	3047	August 1, 2008
About Data transfer speed between CPU and GPU? How to increase the data transfer speed? CUDA Programming and Performance	7	15580	December 11, 2009
What factors effect GPU transfer speed? CUDA Programming and Performance	7	9179	September 15, 2009
[solved] strange cuda memcopy time CUDA Programming and Performance	5	745	March 26, 2015
Bandwidth problem ? Could anyone verify that this is normal? CUDA Programming and Performance	7	3618	April 25, 2008
The speed of data transfer between GPU and CPU CUDA Programming and Performance	4	2693	April 27, 2009

Memory copy speed

Related topics