Unified Memory Access using Jetson TX2

wonshik.kim · April 6, 2018, 2:49am

Hi, I’m testing cuda program using Jetson TX2

But there is problem when I try to use ‘unified memory access’
(It occurs segmentation fault)

So, I query the property of Jetson TX2’s gpu, and I found that It doesn’t support ‘concurrentManagedAccess’ even though it is Pascal architecture.

How can I use unified memory access with ‘concurrentManagedAccess’ property

I’m using Jetpack 3.2 and cuda 9.0

AastaLLL · April 9, 2018, 8:18am

Hi,

For TX2, unified memory requires exclusive access or will lead to a segmentation fault.
Please avoid concurrent CPU/GPU accesses with cudaDeviceSynchronize().

You can find more information in our document here:
http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#um-gpu-exclusive

Thanks.

PeartreeStudios · August 11, 2018, 1:47pm

Hello,
I’m also having this issue (Although with a Jetson TX1 rather than TX2).

The documentation states that cudaStreamSynchronize can also be used for synchronisation.

I still get the bus error using cudaStreamSynchronize, but I don’t get it using cudaDeviceSynchronize.

Unfortunately cudaDeviceSynchronize is not suitable for my application as I wish to update the CUDA buffer from a CPU thread while the GPU works on other streams.

(Note: I’m using the Driver API so the calls for me are actually cuCtxSynchronize and cuStreamSynchronize.)

Are there any other gotchas to be aware of?

Thanks
Russell

PeartreeStudios · August 12, 2018, 4:58pm

Just to update, I’ve managed to get it working with cuStreamSynchronize now.
I think it was just my call to cuMemAllocManaged needed the flag CU_MEM_ATTACH_HOST to be accessible from CPU.

dumbogeorge · November 6, 2018, 9:42am

Hi PeartreeStudios,

Could you please clarify whether you are able to access cuMemAllocaManaged buffer when GPU is still active ?

I am looking to understand meaning of what you said, vs what document says. For example

https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__MEM.html#group__CUDA__MEM_1gb347ded34dc326af404aa02af5388a32

says -

"If CU_MEM_ATTACH_HOST is specified, then the allocation should not be accessed from devices that have a zero value for the device attribute CU_DEVICE_ATTRIBUTE_CONCURRENT_MANAGED_ACCESS; an explicit call to cuStreamAttachMemAsync will be required to enable access on such devices. "

Did you also do a cuStreamAttachMemAsync() to enable concurrent access ?

Thanks

Topic		Replies	Views
Unified Memory On TX1 Jetson TX1	4	855	October 18, 2021
Unified memory and concurrent C++ objects Jetson TX2	10	2502	October 18, 2021
Unified memory concurrent access Jetson AGX Xavier	4	1938	November 19, 2018
Segmentation fault or bug error when use unified memory on jetson nx Jetson Nano cuda	4	465	April 12, 2023
CUDA Unified Memory concurrent access Jetson AGX Xavier cuda	4	1155	October 18, 2021
Usage of Unified Memory on R28.1 vs R24.2.1 Jetson TX2	2	520	October 18, 2021
Cuda memory access with cudaMallocManaged CUDA Programming and Performance camera , cuda	3	84	September 11, 2024
Invalid Managed Memory Access Jetson TX2	2	1214	October 18, 2021
What exactly does the managed memory flag do and what changes? CUDA Programming and Performance	5	1029	January 12, 2022
CPU operation is very slow on memory allocated by cudaMallocHost Jetson TX2	13	1712	October 18, 2021

Unified Memory Access using Jetson TX2

Related topics