Sharing CUDA memory between processes

makkarpov · February 21, 2021, 2:19pm

As stated in CUDA docs, IPC functions do not work on Tegra platforms, including, of course, Jetsons.

Is there any other option how to create CUDA memory buffer that is shared between two separate processes?

AastaLLL · February 22, 2021, 4:02am

Hi,

Could you share more detail about your use case.

Do you want to share the GPU buffer while both processes still alive?
Or it is possible that one of the process will terminate earlier before the other access the buffer?

Thanks.

makkarpov · February 22, 2021, 11:18am

Yes, two processes are still alive. The use case is like one process is a “producer”, and second is a “consumer”, so the first process fills shared CUDA buffer and signals other process that buffer is ready, and after it second process reads it.

E.g. it’s just a zero-copying issue, two processes need to communicate large data, but copying over conventional IPC’s is rather expensive thing.

jgiovino · March 2, 2021, 7:42pm

I am completely ignorant of tegra/jetson.
Is this a SoC architecture?
For SoC architectures, Vulkan has a special memory category called ‘coherent or cohesive or something’ (I think).
This is memory that is directly addressable by both CPU and GPU
If so, is tegra/jetson RAM shared via local address/data bus.
Can applications mmap this type of memory and share between apps as cuda device memory and eliminate the need to use cudaIPCxyz()
It may be possible to address this memory via a vulkan binding but only use it in your kernels.

hannes09 · March 5, 2021, 8:43pm

Also would like to know what is right way to share NvBuffers between multiple processes without copying?

So one producer e. g. a decoder process and multiple other processes that are processing the decodeded buffers for example running in different containers

AastaLLL · March 16, 2021, 11:46am

Hi,

Jetson is a platform that with integrated GPU, which is different from the desktop GPU that connects to host with PCIE.
For memory issue, you can find some information below:

https://docs.nvidia.com/cuda/cuda-for-tegra-appnote/index.html#not-supported-on-tegra

You can try to use EGLStream or NvSci to communicate between CUDA contexts in two processes.

Thanks.

makkarpov · March 20, 2021, 12:12pm

As I understand, any EGL* functions require that desktop environment should be running, which is not our case.

cyrilC · June 28, 2021, 4:26pm

Thank you for your suggestion. I’m also trying to use shared GPU memory across several processes on Jetson Xavier and it seems that NvSci is the alternative to go. However, I couldn’t find the lib. It seems that at the time we’re speaking NvSci is embedded exclusively in the DRIVE package which requires a membership for autonomous driving development. Am I mistaken?

Thanks

Topic		Replies	Views
Cudamemcpy Jetson Nano cuda	2	1222	March 9, 2022
Registering POSIX-CPU shared memory to CUDA with cudaHostRegister CUDA Programming and Performance	5	145	July 16, 2024
Jetson AGX Orin CUDA IPC Support Jetson AGX Orin cuda , docker , pytorch	7	2115	July 29, 2022
Unified Memory on Jetson Platforms Jetson Xavier NX cuda	4	4505	October 18, 2021
Can I reserve CUDA cores for my application? Jetson Orin Nano cuda	8	43	November 27, 2024
Sharing cuda memory between containers x86 and jetson Jetson AGX Orin cuda , deepstream	2	46	October 21, 2024
Is cuMemExportToShareableHandle available on the Nano with CUDA 10.2? Jetson Nano cuda	4	1299	April 11, 2022
How to manage CUDA memory? Jetson Xavier NX cuda , python	4	665	December 28, 2022
Can GPU access CPU cache on Xavier for the sake of coherence? Jetson AGX Xavier cuda	2	489	October 18, 2021
How to share tensorrt between processes Jetson AGX Xavier tensorrt	6	1007	March 1, 2022

Sharing CUDA memory between processes

Related topics