What exactly does the managed memory flag do and what changes?

dkqhzm2 · January 12, 2022, 2:27pm

Yes.
I’ve tried using cudaStreamSynchronize() or cudaDeviceSynchronize() after the kernel call and it works fine.

But using synchronize() after the kernel call is not what I want. This is because we wanted to read the next parameter into the same memory while the kernel was running on the GPU. As you can see from the code, it is never designed to access the same memory address.

And what you quoted says jetson managed memory cannot be accessed concurrently, why do you enable concurrent access using the cudaMemAttachHost flag? Shouldn’t this be used?

Topic		Replies	Views
Performance issues after refactoring CUDA code to avoid managed memory CUDA Programming and Performance jetson	5	57	November 19, 2024
Unified Memory Access using Jetson TX2 Jetson TX2	5	2327	October 18, 2021
CPU operation is very slow on memory allocated by cudaMallocHost Jetson TX2	13	1722	October 18, 2021
Pascal & capabilities 6.0 show cudaDevAttrConcurrentManagedAccess is 0 CUDA Programming and Performance	15	1367	December 27, 2018
Unified Memory On TX1 Jetson TX1	4	857	October 18, 2021
RE: Performance issues after refactoring CUDA code to avoid managed memory Jetson AGX Xavier cuda	4	36	November 25, 2024
Kernel invocation invalidates unified memory blocks CUDA Programming and Performance	9	1068	January 8, 2018
Unified memory and concurrent C++ objects Jetson TX2	10	2511	October 18, 2021
Managed memory vs cudaHostAlloc - TK1 CUDA Programming and Performance	10	6123	February 22, 2016
Asynchronous memory transfer on Jetson TX1 Jetson TX1	10	1618	October 18, 2021

What exactly does the managed memory flag do and what changes?

Related topics