Sharing PagedLockMemory between Processes

jirikraus · September 24, 2009, 3:10pm

Hello,

sorry for starting a new topic about a Whish. But i was not able to reply to the Whishlist at: [url=“http://forums.nvidia.com/index.php?showtopic=45522&pid=248850&mode=threaded&start=#entry248850”]http://forums.nvidia.com/index.php?showtop...rt=#entry248850[/url] because it was locked.
My prolem is that i need to share Host Memory buffers between different processes on a linux system. Currently i am using Linux System V shared memory segments for this. I need to copy this buffers to CUDA Device memory. As i can not register an existing buffer with the CUDA runtime for dma transfers and i can not attach a buffer allocated with cudaMallocHost too a second processes i am currently out of luck and can not use dma transfers.
This threads also discuss a similiar prolbem
http://forums.nvidia.com/index.php?act=ST&…=71&t=41710
And something like this has been already mentioned on the Wishlist here
[url=“http://forums.nvidia.com/index.php?showtopic=45522&pid=318401&mode=threaded&start=”]http://forums.nvidia.com/index.php?showtop...aded&start=[/url]

As these two post both are quite old i want to know if there is anything new about this issue? To solve my problem i see posibilities:

Extend the CUDA Runtime with a function like shmget and shmat from linux Sytem V to be able to share a buffer created by cudaMallocHost between processes.
Create a buffer with shmget, pagelock that buffer with shmctl and register this paged locked buffer with the CUDA runtime.
Please correct me if i am wrong or have overlocked something.

The dma transfers are important for me in two ways:

Best regards

Jiri Kraus

pj_wiersema · September 27, 2009, 10:59am

I’d like to be able to do the same thing! I already asked for this a while ago in a wish-list topic. Some advantages i see:

[*]Ability to really separate I/O from algorithm

[*]easily switch algorithm or I/O at runtime

[*]i’m sure there are more!

purpledog · October 3, 2009, 5:22pm

same here

Topic		Replies	Views
Sharing GPU global memory with multiple CPU threads CUDA Programming and Performance	5	2608	February 26, 2019
Sharing CUDA Host Memory Between Processes CUDA Programming and Performance	10	30265	May 12, 2018
CUDA device pointer host-side processes sharing implementation CUDA Programming and Performance	0	664	June 7, 2016
Memory from peripheral devices to GPU DMA directly to another device... CUDA Programming and Performance	6	4152	August 16, 2009
Why are transfers faster for cudaMallocHost? Even after I page lock "regular" memory. CUDA Programming and Performance	2	6540	January 11, 2008
selfmade cudeMallocHost()? CUDA Programming and Performance	9	8648	February 14, 2008
mmap and pinned memory How to share a pinned buffer between several linux processes ? CUDA Programming and Performance	1	1709	August 2, 2011
Async transfers with non-cuda host memory using page-locked memory not cuda memory CUDA Programming and Performance	5	11624	July 4, 2008
Page-locked memory CUDA Programming and Performance	9	9060	April 8, 2009
Data transfer between multiple GPUs How to do it fast ? CUDA Programming and Performance	4	2535	January 21, 2010