Multiple GPU's and sharing memory Will a CUDA API eventually be provided for this?

DonM · May 19, 2009, 10:35pm

I’ve read various posts about this topic and understand that the programmer must explicitly handle multiple GPU’s independently. I also understand that it is possible to use cudaMemcpy between devices and even to speed this up by using pinned memory shared between host threads (as of toolkit 2.2).

However, what I would like to do is transfer directly from the GPU memory of one card (of say an S1070) to another, without having to go through host memory.

Is this already possible and I am not finding the API to do so?

Or, might it be possible in the future with CUDA?

Tangential question, if anyone knows, does OpenCL provide the means of using multiple devices transparently?

Thanks.

tmurray · May 19, 2009, 11:31pm

It’s not possible now, but it may be possible in the future, yes.

OpenCL does not support any sort of transparent multi-GPU acceleration.

ktashiro · June 25, 2010, 6:48pm

Hi. We’re investigating the capability of multiple GPU boards peer-to-peer communication. Or possibly data transfer to/from other FPGA PCIe board, without using the host memory.

Back in May, 2009, about a year ago, the answer was NO (according to the answer posted by Tmurray".

Is it still the same status, or the CUDA has added such new feature??

parallelis · June 28, 2010, 2:32pm

AFAIK but not having testing it myself (due to lack of test-bed with 2 GPU with Pinned Mapped Memory capability), it seems possible to exchange data between GPU using Pinned Mapped Memory, that is allocated for all GPU at-once.

The drawback is that data will be written in main memory, and read from it, but it’s bandwidth is way higher than PCI-e 16x bus, so I don’t see it as a problem in itself.

ktashiro · June 28, 2010, 3:42pm

Thank you for the reply, iAPAX. I was wondering peer-to-peer communication between 2 GPU boards, but this can be an alternative solution if 2 GPU boards can communicate through the host memory (via DMA). Thank you for the advice.

Sorry for the beginer’s question; if 2 boards are GPU boards, using CUDA API, we can allocate pinned mapped memory in host memory and 2 boards can share and DMA data through this memory (correct?). I assume it’s virtual memory space returned by CUDA that the GPU(s) actually can access to (correct?).

I’m also considering the possility of whether our custom FPGA PCe board can send data to the GPU board via DMA (want to avoid the host SW to do this for the performance reasons.) In the previous example of 2 GPU boards, I assume 2 GPU boards can share and read/write from/to the virtual memory space of the host pinned mapped memory, allocated by CUDA. Does CUDA have a way to return the physical memory or can we specify the physical memory’s base address, so the 3rd non-GPU board (ex. FPGA board) can share the same pinned mapped memory space?

Kenji

Topic		Replies	Views
copy in multi GPUs CUDA Programming and Performance	13	4932	February 7, 2009
how to use portable pinned memory for multiple gpu CUDA Programming and Performance	1	3088	September 7, 2009
Data copy between multi-GPUs CUDA Programming and Performance	2	1606	October 14, 2008
Host to multiple device transfers CUDA Programming and Performance	0	2317	January 20, 2012
Copy data from one GPU to another CUDA Programming and Performance	2	2212	July 1, 2010
Data transfer between two GPUs CUDA Programming and Performance	6	2851	September 9, 2009
Question about multi-GPU programming Memory accesses and sharing CUDA Programming and Performance	10	7280	January 13, 2009
PCI Transfer directly to GPU anytime soon? CUDA Programming and Performance	8	3803	November 30, 2010
How to communicate beetween two GPUs Tesla D870 : two tesla C870 GPUs CUDA Programming and Performance	2	1647	April 10, 2008
selfmade cudeMallocHost()? CUDA Programming and Performance	9	8729	February 14, 2008

Multiple GPU's and sharing memory Will a CUDA API eventually be provided for this?

Related topics