GPU <--> Xilinx FPGA failing, "Unsupported Request"

ryan.walton · October 31, 2024, 8:59pm

Hi,

I’m working on developing peer-to-peer communication between a Xilinx FPGA and a Turing GPU. x86_64 Intel host, with Broadcom (PLX) PCIe switches, RHEL9, NVIDIA driver 550.120.

I am encountering issues when trying to perform read and writes mastered by the FPGA to the GPU. I have followed the GPU Direct documentation online, and I’ve verified that I’m properly pinning Cuda allocated GPU memory into the GPU BAR.

I can do the following:

Master read and writes from the FPGA to the x86 host
Master read and writes from the GPU to the FPGA BAR
Master read and writes from the x86 host to the GPU, by mapping the section of the GPU BAR that holds the Cuda allocated buffer into the x86 host’s virtual memory.

However, when I try to master transactions from the FPGA to the GPU, I run into issues.

For writes mastered by the FPGA to the GPU, the GPU memory remains untouched.
For reads mastered by the FPGA to the GPU, I get “Unsupported Request” failure codes in the TLP completion packets.

I have data link layer data recordings captured by the FPGA for when this issue is occurring that I can post if it helps solve the issue.

I have made sure to disable the Intel “Access Control Service” in BIOS. The GPU is connected by a Broadcom (PLX) PCIe switch that should have and MMU functionality disabled.

What could be going on here?

Thanks
Ryan

Topic		Replies	Views
Reading GPU Memory fails for GPUDirect RDMA driver CUDA Programming and Performance	0	169	January 31, 2024
FPGA cannot communicate with A100 through XDMA Using RDMA RDMA Software For GPU rdmaroce-solutions	5	261	June 12, 2024
Error when trying to write data to GPU DMA memory (using GPU Direct RDMA) Jetson AGX Xavier pcie , kernel , fpga	8	1405	May 30, 2023
Peer-to-peer communication between GPU and FPGA CUDA Programming and Performance	2	1951	April 18, 2022
GPU direct access to DMA memory over PCIe Jetson Xavier NX pcie , cuda	4	2182	April 22, 2022
GPUDirect RDMA:FPGA cannot communicate with A100 through XDMA GPU - Hardware rdma-and-roce	0	163	May 29, 2024
P2P communication between GPU and FPGA CUDA Programming and Performance cuda	3	1885	December 6, 2022
Pinning GPU memory for RDMA failed CUDA Programming and Performance	1	550	April 3, 2022
Can't access 3rdParty PCIe device after cudaHostRegister() CUDA Programming and Performance	4	543	October 27, 2022
DMA Transfer between Third party device, host, and GPU CUDA Programming and Performance hw , cuda	0	660	June 8, 2020

GPU <--> Xilinx FPGA failing, "Unsupported Request"

Related topics