Why do SimpleIPC example use sharedMemoryCreate before cudaMalloc?

Tumb1eweed · July 19, 2022, 8:05am

Recently I need to use CUDA IPC to transfer image data between ROS2 nodes, and I try to grasp CUDA IPC by reading simpleIPC example. But some code confuse me, why need sharedMemoryCreate before cudaMalloc? Doesn’t shm_open allocate memory in host memory?

Robert_Crovella · July 19, 2022, 2:07pm

The CUDA IPC mechanism allows for sharing of a device memory allocation from one process to another. The steps needed are approximately as follows:

Process A allocates device memory.
Process A gets a CUDA IPC handle for the allocation from step 1
Process A creates a host IPC instance, so that the handle from step 2 can be communicated to process B
Process A puts the handle into the host IPC mechanism
Process B picks up the handle from the host IPC mechanism
Process B uses the handle to “request access” to the underlying allocation

So the host IPC mechanism is needed for the communication of the device/CUDA IPC handle, from process A to process B.

system · August 2, 2022, 2:08pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Why exporting and importing CUDA IPC handles in the scope of the same Linux process is not supported? CUDA Programming and Performance cuda	7	665	May 10, 2023
cudaMalloc and sharing between CPU threads CUDA Programming and Performance	0	4342	May 20, 2009
How is cudaMalloc implemented? CUDA Programming and Performance	0	4702	July 16, 2010
How to share CUDA memory between two processes? CUDA Programming and Performance	3	2874	July 9, 2018
What will happen if I create interprocess memory by cudaIpcGetMemHandle while no other program process it? CUDA Programming and Performance cuda	4	39	August 15, 2024
Share GPU/host pinned memory between host processes CUDA Programming and Performance	5	4011	March 7, 2012
How to share the memory allocated by cudamallocasync during graph capture? CUDA Programming and Performance	2	182	June 1, 2024
CUDA 4.1 RC1: "Peer-to-peer communication between processes"? CUDA Programming and Performance	4	7988	November 9, 2011
Memeory allocation on Host Memory allocation to Host to Device Transfer CUDA Programming and Performance	2	1355	December 10, 2009
Using Shared Data resting in GPU across multiple programs CUDA Programming and Performance cuda	4	61	August 8, 2024

Why do SimpleIPC example use sharedMemoryCreate before cudaMalloc?

Related topics