why using pinned memory is faster?

abdullah · November 29, 2007, 11:09pm

Hi,

I am little bit puzzled with the reason behind having faster memory transfers when using pinned memory.

As far as I understood, using pinned memory will allow using DMA when transferring data from host to device memory since DMA operates only with physical addresses, not with virtual address. On the other hand, when using pageable memory (malloc), then this is not the case, hence ordinary move instructions used for the transfer, therefore, it is slower. Is this right?

Mark_Harris · November 29, 2007, 11:26pm

The GPU always must DMA from pinned memory. If you use malloc() for your host data, then it is in pageable (non-pinned memory). When you call cudaMemcpy(), the CUDA driver has to first memcpy the data from your non-pinned pointer to an internal pinned memory pointer, and then the host->GPU DMA can be invoked.

If you allocate your host memory with cudaMallocHost and initialize the data there directly, then the driver doesn’t have to memcpy from pageable to pinned memory before DMAing – it can DMA directly.

That is why it is faster.

Mark

abdullah · November 29, 2007, 11:39pm

Thanks Mark,

That was really informative.

wumpus · November 30, 2007, 12:11pm

Some network (and disk) adapters support scatter/gather DMA, which allows the host to send a list of physical addresses and thus DMA from and into the virtual memory space (of course, it still won’t work if the virtual memory is swapped out to disk).
But as G80 (afaik) doesn’t support this, pinning is really neccesary.

Topic		Replies	Views
Pinned Memory slower than pageable memory CUDA Programming and Performance	4	3129	September 16, 2010
Is it possible to use pinned memory? Outside of CUDA CUDA Programming and Performance	7	6171	February 14, 2008
question about page locked memory CUDA Programming and Performance	2	8555	April 21, 2009
pageable and non-pageable memory CUDA Programming and Performance	2	6349	December 31, 2008
Advantages/Disadvantages of using pinned memory CUDA Programming and Performance	6	13142	May 4, 2018
Pinning Memory and DMA (and other i/o speed up?) CUDA Programming and Performance	2	744	September 30, 2009
pinned memory CUDA Programming and Performance	5	3396	February 10, 2009
Transfer Speed For AWE-Allocated Memory CUDA Programming and Performance	6	2932	March 20, 2013
Page-locked memory CUDA Programming and Performance	9	9061	April 8, 2009
Pinned Memory Usage CUDA Programming and Performance	0	3112	November 6, 2009

why using pinned memory is faster?

Related topics