CUDA UVM MEMORY USAGE - IMPLEMENTATION DETAILS

trueno · January 9, 2015, 7:38pm

Hi there!

I have a question on UVM and its implementation details.

I think that UVM can make use of not only DDR5 memory, but other, yet faster memory types in the GPU (shared, L1/L2 cache, registers). On the contrary, there are researchers that claim that UVM makes only use of GPU’s DDR5 memory (global memory, as I understand). However, looking through the documentation provided by nVidia I have not been able to confirm either extent.

Is there any document stating the inner mechanisms of the UVM implementation, or more generally, what types of memory and under which circumstances it uses them? And about the optimizations made by the compiler/“UVM manager”?

I would really appreciate any direction on these subjects.

Thanks in advance,

Robert_Crovella · January 9, 2015, 8:19pm

I don’t have any implementation details. But UM (“Unified Memory”)

[url]Programming Guide :: CUDA Toolkit Documentation

applies to global memory only. Not local, shared, constant, texture, or any other type. If you study the above programming guide section, this will be fairly evident, and in some cases, explicit (e.g. with respect to constant memory).

trueno · January 15, 2015, 10:21pm

Thanks for the post!

It seems I overread the part referring to constant declarations. How embarrasing! :-0

There are, however, some thoughts/questions about that:

It would be great if nVidia could state this explicitly in their documentation.
If UVM is meant to lower the entrance barrier to GPU programming, is the compiler making any further memory optimizations or are shared and other faster memory types being just neglected when managed variables are used? It does not seem very smart to me to leave those faster memory types only to the experienced programmer making use of old memory statements.

Any hint on the previous questions or directions on where to ask for further information about UVM implementation details would be highly appreciated.

Thanks again!

Topic		Replies	Views
Memory allocation for GPU CUDA Programming and Performance	3	2339	March 28, 2009
Unified memory and CUDA-aware MPI CUDA Programming and Performance	6	1548	February 28, 2020
CUDA 6.5 Unified Memory (cudamallocmanaged) CUDA Programming and Performance	1	2170	February 18, 2015
"What" is the Unified Memory? CUDA Programming and Performance	3	4838	October 11, 2017
Abysmal performance with Unified Memory and CUBLAS CUDA Programming and Performance	15	4295	November 29, 2014
Unified memory CUDA Programming and Performance	2	731	November 11, 2019
Can I use Unified Memory in a soft real-time system? CUDA Programming and Performance	13	348	April 1, 2024
on-chip memory on integrated GPU CUDA Programming and Performance	1	8644	May 17, 2010
Unified Memory vs Pinned Host Memory vs GPU Global Memory CUDA Programming and Performance	9	8801	June 1, 2022
Are GPU allocated pointers unique? CUDA Programming and Performance	5	805	January 4, 2018

CUDA UVM MEMORY USAGE - IMPLEMENTATION DETAILS

Related topics