cudaMallocManaged on jetson devices

NikolayChernuha · March 5, 2023, 9:42pm

Hello,

I want to understand better the difference between Unified Memory and Pinned Memory on jetson devices.
As I read here: Memory Management, both will be accessible on CPU and iGPU, but what is the best practice for large buffers? I plan to access those buffers from CPU, NPPI and CUDA kernels.

Thanks,
Nikolay

njuffa · March 5, 2023, 10:52pm

Question regarding Jetson platforms typically received better / faster / more numerous responses in the sub-forums dedicated to them:

Robert_Crovella · March 6, 2023, 1:10am

Before posting on a jetson forum:

If you study table 1 as well as section 4.1 you may get some insight from that.

Anyone trying to help you would likely immediately want to know concepts covered there, such as what is the compute capability of your device, and do you need coherent (for the sake of this discussion, lets say “simultaneous”) access between CPU and GPU.

You likely would be able to create a more focused posting on the Jetson forum of your choice, by giving some thought to what is presented in the document you linked.

If it were me, to a first order approximation, and with no additional information, I would say that the “cached” characteristic(s) of managed memory on Jetson (as indicated in table 1) vs. the “uncached” characteristic of pinned memory on Jetson, would cause me to immediately prefer managed memory for general usage.

If you are unfamiliar with what a cache is and why it might be interesting, that concept is not unique or specific to CUDA or Jetson, a google search will enlighten.

NikolayChernuha · March 6, 2023, 2:01pm

Thanks for the information!

Topic		Replies	Views
Optimising GPU and CPU memory transfer time (CUDA/Hardware)? CUDA Programming and Performance hw , cuda	8	4042	January 7, 2022
Zero-Copy and Managed memory on Jetson Jetson TX1	9	11604	August 20, 2018
Different types of memory transfer change the execution time of kernel on Tegra x1 Jetson TX1	5	860	October 18, 2021
Performance issues after refactoring CUDA code to avoid managed memory CUDA Programming and Performance jetson	5	57	November 19, 2024
CPU operation is very slow on memory allocated by cudaMallocHost Jetson TX2	13	1722	October 18, 2021
Using CUDA Unified memory on embedded board (psychical unified memory) CUDA Programming and Performance	6	1491	July 14, 2016
Unified Memory on Jetson Platforms Jetson Xavier NX cuda	4	4497	October 18, 2021
RE: Performance issues after refactoring CUDA code to avoid managed memory Jetson AGX Xavier cuda	4	36	November 25, 2024
Question about cudaManagedMemory and zero-copy memory for Jetson AGX Jetson AGX Orin cuda	2	38	November 18, 2024
Asynchronous memory transfer on Jetson TX1 Jetson TX1	10	1618	October 18, 2021

cudaMallocManaged on jetson devices

Related topics