PCIe DMA broadcast & CUDA

pstach · August 18, 2008, 8:41pm

Does CUDA currently support broadcast DMA transfers? I’m experiencing some bottlenecks in transfers from memory to multiple cards and this seems to be an option with some MCH’s like the one found in the 790i Ultra SLI. If not, is there a way to do GPU to GPU transfers via the PCIe bus that is supported? I’ve seen this done with opengl shader demos before.

pstach · August 19, 2008, 3:19am

Also, is there a way to share cudaMallocHost regions among devices? I realize that the CUDA runtime makes calls to the driver to setup the region, but couldn’t the runtime expose a way to reuse these regions?

Simon_Green · August 19, 2008, 11:18am

The answer to both questions is - not currently, but we’re working on it.

See this thread:
[url=“The Official NVIDIA Forums | NVIDIA”]http://forums.nvidia.com/index.php?showtopic=41710[/url]

Fast peer-to-peer GPU transfers is one of my most desired CUDA features, so I’m on your side!