GPUDirect RDMA PCIe Topology

jarush · October 20, 2021, 3:30pm

I am working on a hardware design with a CPU connected to a PCIe 3.0 switch over 4 PCIe lanes (x4) and an FPGA and GPU connected to the same PCIe switch over 16 lanes (x16). When performing a GPUDirect RDMA to transfer data between the FPGA and GPU, will the two devices use all 16 lanes, or will the CPU connected to the PCIe switch with only 4 lanes effect the transfer speed or number of lanes used between the FPGA and GPU?

rs277 · October 20, 2021, 6:54pm

Looking at the block diagram here:

If the hardware is GPUDirect compliant, the CPU should have no significant involvement.

nunduniel · October 23, 2021, 5:16pm

I have a similar design.
I have x8 from CPU to switch and x16 between GPU and FPGA on a 48lane PCIe switch.
I get full bandwidth for RDMA between FPGA and GPU.

system · November 6, 2021, 5:16pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
GPU Direct + PCIe topology CUDA Programming and Performance	3	241	June 27, 2024
Questions on GPUs for software-defined radios CUDA Programming and Performance	2	3022	February 23, 2016
GPUDirect RDMA Single PCI-e writes CUDA Programming and Performance	2	568	October 23, 2018
Using GPUDirect RDMA under OpenCL CUDA Programming and Performance	2	1510	August 7, 2024
Slow Memory Copies CUDA Programming and Performance	7	1166	November 6, 2018
FPGA - GPU, no host PC CUDA Programming and Performance	5	2024	March 3, 2018
GPUDirect RDMA with FPGA PCIe EP on Jetson Orin AGX CUDA Programming and Performance	0	26	April 14, 2025
GPU Communication Protocol CUDA Programming and Performance	16	6256	May 17, 2010
GPU2FPGA transfer rate is lower than FPGA2GPU when using GPUDirect RDMA CUDA Programming and Performance	6	1339	May 27, 2022
RDMA GPU Direct Slow CUDA Programming and Performance	10	2407	February 13, 2019

GPUDirect RDMA PCIe Topology

Related topics