Unified memory

trinayan · May 16, 2017, 7:18pm

Since the Pascal architecture supports memory coherence can we say for sure that the TX2 also supports full memory coherency between the ARM CPU and Pascal GPU?

snarky · May 17, 2017, 2:17am

I think that depends on what you mean by “coherence.” ARM, itself is not as coherent as you’d be used to from the Intel/x86 world. The separate GPU is likely to add additional necesary synchronization points. It seems to me like it wouldn’t be possible to have a high performance GPU share an automatically coherent memory bus/ring with a multi-core general CPU. So: “signs point to no, but I’ve been wrong before!”

The question is: What do you want to do with this? All the APIs that allow you access to the GPU resources take care of coherency for you.

trinayan · May 18, 2017, 3:14pm

Hi,

Well thanks. But what I meant was with pascal nvidia supported memory coherence. For some reason I notice that cuda malloc managed fails on jetson tx2. Any reasons why this happens?

Best,
Trinayan

snarky · May 18, 2017, 6:31pm

No idea, sorry. Sounds like a CUDA bug, and/or some limitation in how much you can get/use.
I see you have another post explicitly about the CUDA malloc question, so let’s hope you get better answers there!

AastaLLL · May 19, 2017, 4:23am

Hi,

Thanks for your question.

We test standard um sample in tx2 and it works properly.
Could you share the source hit error in unified memory?

Please notes that you need to compile it with specified SM architecture.
Or you will got error when compiling sm_20 archi but it won’t be used on tx2.

/usr/local/cuda/bin/nvcc -gencode arch=compute_62,code=compute_62 test.cu -o test

Topic		Replies	Views
Unified Memory Access using Jetson TX2 Jetson TX2	5	2442	October 18, 2021
Programming strategy for pascal/TX2 memory hierarchy CUDA Programming and Performance	0	380	July 31, 2018
TX2 Memory Linux	0	564	July 20, 2018
TX2 GPU and CPU shared same memory space Jetson TX2	2	1455	October 18, 2021
On demand paging Jetson TX2	5	1534	October 18, 2021
CPU operation is very slow on memory allocated by cudaMallocHost Jetson TX2	13	1950	October 18, 2021
The memory sharing between cpu and gpu in Jetson TX2 Jetson TX2	6	7436	October 18, 2021
size of shrare memory between CPU and GPU Jetson TX2	7	1460	October 18, 2021
Unified Memory On TX1 Jetson TX1	4	941	October 18, 2021
Is PX2 support Unified Memory? DRIVE - Linux	1	910	October 9, 2017

Unified memory

Related topics