Zero Copy vs Pinned Memory Performance . Need some explanation

mdotali · November 24, 2016, 7:18am

I have a lense correction kernel using OpenCv (Cv4Tegra) running on nVidia Tx1.

I have tested the kernel using two of the memory models.

1- Pinned Memory allocated using GpuMat . Uploading data to it . Processing it. then downloading.

2 - Zero Copy Mapped memory. no uploading , no downloading. Just processing

Since Tx1 is an integrated GPU with same memory space as the host so I shouldn’t have to “upload” to device memory before processing. If i understand it correctly, there is no device memory per-say.

I ran my tests and approach 1 is twice as fast as approach 2. even with uploading and downloading.

So when we “upload” to GpuMat what exactly is happening ? Why is this faster.

Similarly, why is processing on zero copy data slower.

What does it mean by “read once write once” ? is it w.r.t the whole matrix or is it talking about indexing e-g read index 0 only once. Do not go back to index 0 again.

I have gone through the documentation already but I haven’t been able to figure out why the performance loss instead of gain.

Topic		Replies	Views
zero-copy pinned memory and cuda 4.0 CUDA Programming and Performance	1	3968	January 25, 2012
Page Locked Memory CUDA Programming and Performance	3	1001	May 5, 2011
Zero Copy VS Page-Locked CUDA Programming and Performance	5	1162	September 19, 2011
Weird pageable <-> pinned memory performance CUDA Programming and Performance	6	2993	June 10, 2009
zero copy : Device 0 cannot map host memory! zero copy not working, unable to map host memory? CUDA Programming and Performance	4	6502	June 9, 2009
Cuda 2.2 / Zero-copy access CUDA Programming and Performance	33	42373	May 1, 2009
Zero copy & poor performance CUDA Programming and Performance	14	3380	September 16, 2010
Tegra K1 MatVec Multiplication Benchmark Revision (Zero Copy vs Unified Memory) CUDA Programming and Performance	3	1328	February 14, 2016
Could someone compile simple example for me on the mobile card? CUDA Programming and Performance	20	10241	November 11, 2009
Pinned Memory zero copy No-copy pinning of system memory CUDA Programming and Performance	3	1139	December 1, 2011

Zero Copy vs Pinned Memory Performance . Need some explanation

Related topics