variance in speed from host-device and device-host

sidzonline85 · March 4, 2010, 8:54am

When I transfer an array of size X from host to device, the transfer is faster than from device to host for the same array of size X. Why is this so?
I am using only cudaMemcpy without pinning or ASYNC options.

seibert · March 4, 2010, 4:09pm

How much different? Some difference is normal for various motherboards, but if the difference is large, you might have a problem. (I have seen 20% differences in host-to-device compared to device-to-host. Never did figure out why, though.)

Topic		Replies	Views
Slow device to host transfer CUDA Programming and Performance	1	3118	June 14, 2007
cudaMemcpyDeviceToHost time procces CUDA Programming and Performance	6	3059	August 1, 2008
cudaMemcpyDeviceToHost taking much time? CUDA Programming and Performance	3	2713	July 15, 2009
cudaMemcpy CUDA Programming and Performance	0	1223	November 20, 2008
Device to Host memcpy How do i make this faster? CUDA Programming and Performance	2	2549	February 6, 2008
cudaMemCpy HostToDevice VS. DeviceToHost CUDA Programming and Performance	5	3284	June 4, 2015
Memory Read and Write to device gives different timing CUDA Programming and Performance	3	1512	November 3, 2009
time of copy CUDA Programming and Performance	0	790	June 18, 2010
data transfer rates different speeds data transfer from D2H faster than H2D CUDA Programming and Performance	3	7313	March 12, 2011
CudaMemcpy() speed/bandwidth For host to device CUDA Programming and Performance	5	10042	June 30, 2009

variance in speed from host-device and device-host

Related topics