Why does it take longer for a program to use Unified Memory than not to use Uuified Memoery?

zzy3797777861 · January 19, 2021, 6:52pm

Code address：GitHub - zhuzhuoyue/cuda_benchmarks

use Unified Memory:(simpleManaged.cu)
./simpleManaged 400000000

host: MallocManaged: 1.082769
host: init arrays: 3.432402
device: uvm+compute+synchronize: 0.013866
host: access all arrays: 6.175977
host: access all arrays a second time: 0.570206
host: free: 0.382470
total: 11.658073

without using Unified Memory:(simpleMemcpy.cu)
./simpleMemcpy 400000000

host: MallocHost: 1.311571
host: init arrays: 3.348044
device: malloc+copy+compute: 1.734390
host: access all arrays: 2.175081
host: access all arrays a second time: 0.552091
host: free: 0.416059
total: 9.537628

AastaLLL · January 20, 2021, 2:35am

Hi,

First, please remember to maximize the device performance as below:

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

Unified memory doesn’t require memory copy but does have some overhead in buffer synchronization.
Here is our document for Jetson memory for your reference:

Thanks.

Topic		Replies	Views
Why does it take longer for a program to use Unified Memory than not to use Uuified Memoery? Jetson AGX Xavier cuda	3	456	October 18, 2021
Bad performance when using unified memory CUDA Programming and Performance	2	3479	April 21, 2019
Unified Memory has poor performance on Jetson AGX Xavier Jetson AGX Xavier cuda	6	1288	February 9, 2022
Is Unified Memory in Tegra always fast? CUDA Programming and Performance cuda	0	494	December 17, 2020
Zero Copy Memory vs Unified memory CUDA processing Jetson TX1	28	21059	October 18, 2021
Kernel lunch overhead increases significantly (10x) when using unified memory on TK1 and TX1 Jetson TK1	5	3445	August 31, 2018
Unified Memory Access Performance of Arrays of Structures Problem on Jetson TX2 Jetson TX2 cuda	5	710	October 18, 2021
Accessing Unified Memory from ARM is very slow Jetson AGX Xavier cuda	2	560	October 18, 2021
Using CUDA Unified memory on embedded board (psychical unified memory) CUDA Programming and Performance	6	1587	July 14, 2016
Significant performance problem with Unified Memory based on driver version CUDA Programming and Performance	2	1467	July 31, 2018

Why does it take longer for a program to use Unified Memory than not to use Uuified Memoery?

Related topics