Does cudaMemcpyAsync require pinned memory?

jiazhe · November 25, 2015, 2:19am

Hello! Have some questions about pinned memory.

Is pinned memory necessary when we want to perform async-memory copy?

2.If the answer is yes, is there any size limitation of pinned memory?
For example, if we have a 64GB host memory machine, is 4GB pinned memory will influence CPU performance significantly?

Thanks!

CudaaduC · November 25, 2015, 2:35am

Yes, I believe so according to this page;

http://devblogs.nvidia.com/parallelforall/how-overlap-data-transfers-cuda-cc/

“The host memory involved in the data transfer must be pinned memory.”

I have always used pinned memory with cudaMemcpyAsync and do see overlapping behavior.

Using 4 GB out of 64GB host memory will not degrade CPU performance. There is some additional overhead related to the initiall pinned memory allocation (more than a regular host malloc)

Robert_Crovella · November 25, 2015, 2:37am

Yes (and no). If you want truly asynchronous behavior (e.g. overlap of copy and compute) then the memory must be pinned. If it is not pinned, there won’t be any runtime errors, but the copy will not be asynchronous - it will be performed like an ordinary cudaMemcpy.

The usable size may vary by system and OS. Pinning 4GB of memory on a 64GB system on Linux should not have a significant effect on CPU performance, after the pinning operation is complete. Attempting to pin 60GB on the other hand might cause significant system responsiveness issues. YMMV.

Topic		Replies	Views
Does cudaMemcpyAsync require host memory to be pinned? CUDA Programming and Performance cuda	1	408	October 6, 2022
Maximum limit on the amount of pinned memory using cudaMallocHost() CUDA Programming and Performance	5	12357	July 10, 2010
cudaMemcpyAsync and pinned memory CUDA Programming and Performance	1	1074	August 31, 2021
Pinned memory that's not CUDA Programming and Performance	3	305	February 26, 2024
Problems with cudaHostAlloc and cudaMemcpyAsync CUDA Programming and Performance	5	4531	February 8, 2010
Searching some infos on cudaStreams CUDA Programming and Performance	6	6143	January 26, 2012
Problem with asynchronous host to host memcpy CUDA Programming and Performance	1	4719	January 4, 2011
Host to Device memcpy overhead CUDA Programming and Performance	2	1161	March 17, 2009
Memcpy_async() to host memory CUDA Programming and Performance	4	409	February 12, 2024
Overlapping computation and data transfers must use pinned memory or UVA? CUDA Programming and Performance	1	612	August 13, 2018

Does cudaMemcpyAsync require pinned memory?

Related topics