GPUDirect in PCI-Passthrough configuration

fabian.melendez · December 6, 2021, 8:57am

Hello all,

We have a setup in one Workstation with two Virtual-Machines. Each machine has a GPU, and they form a sort of a processing-visualization pipeline: the results processed in the linux VM are then visualized in Windows.

The communication between the two VMs happens through the Hypervisor (virtual-network). This involves copying the information from one GPU to the linux-vm, and then to the windows-VM and then to the other GPU. This is CPU intensive and we would like to explore the GPU Direct technology for it.

The question is:

Would GPUDirect work in this PCI-Passthrough setup? Kind of only having the network interfaces and a cable doing a Loop-back to the same machine?

We this we want to offload some of the load on the CPU/Hypervisor.

Thanks.

sluo · December 6, 2021, 9:06am

I am curious about why you need two systems for one Task. Is it possible to migrate the application on Linux to Windows since we have CUDA, TensorRT supported on windowns?

fabian.melendez · December 6, 2021, 11:59am

Unfortunately, that part of the design is due to legacy code. In a future a restructuring would be possible, for now we have to work with this setup.

sluo · December 6, 2021, 1:41pm

Since this is a legacy implementation, what is your legacy HW set up for these two OS applications?

fabian.melendez · January 5, 2022, 8:30pm

Well, the HW itself can be upgraded. We are using now a RTX4000 in Linux, and a GTX 1070 in Windows. But we are more interested in the general question of whether or not a GPUDirect link could be done between two GPUs in the same system, to avoid overloading the CPU with copying of the data.

sluo · January 6, 2022, 1:00pm

It is only possible when using the same operating systems for two GPUs inside one system.

pointshader · January 25, 2022, 6:28pm

Hi @sluo , I have a question that builds on @fabian.melendez 's post.

To confirm, if I am running two VMs, both running Linux, each with it’s own independent GPU assigned via PCIe Passthrough, I should be able to initiate GPUDirect communication for GPU Direct RDMA transfers over PCIe, not hitting shared CPU memory?

As followup: Would this work with two Windows VMs? Would GPU Direct between the two VMs over a physical NVLink be supported on either Linux or Windows, to accelerate transfers?

I’ve wondered about this for a long time, hopefully you’ve run across some use cases like this.

Thanks!

Topic		Replies	Views
how to best transfer memory between GPUs sitting on different PCI controllers CUDA Programming and Performance	0	1859	February 20, 2012
Can I achieve multi-GPUs communication in one node through MPI? CUDA Programming and Performance	1	352	January 20, 2021
nVIDIA video cards for Virtual Machines CUDA Programming and Performance	9	8366	March 6, 2019
Ubuntu not recognize NVIDIA GPU Linux	12	4819	August 22, 2019
Usefulness of GPUDirect Usefullnes of GPUDirect to transfer Render Scene to host CUDA Programming and Performance	1	5116	October 12, 2010
Mellanox X5 InfiniBand c++ Programing on windows and linux InfiniBand/VPI Adapter Cards	3	1167	September 1, 2023
P2P Transfers Across Single PCIe Switch Fail CUDA Programming and Performance	5	1352	April 15, 2024
How to use GPUDirect (or others) to send video to FPGA over PCIe? CUDA Programming and Performance	1	615	May 3, 2024
Maxwell. Overlapping data transfers CUDA Programming and Performance	6	1158	January 29, 2015
Programming with NVLINK CUDA Programming and Performance	9	5750	April 18, 2018

GPUDirect in PCI-Passthrough configuration

Related topics