I have an NVIDIA A100 card connected to a server through PCIe gen 4. I want to transfer data directly from an FPGA card to the A100 for real-time data processing. If I transfer data from FPGA to CPU RAM (XDMA) and from RAM to GPU (cudaMemcpy) for processing, it would be very slow and does not meet my requirements. So I have the following questions:
- Is A100 support GPUdirect RDMA?
- Is it possible to transfer data directly from the FPGA to the A100 through GPUdirect RDMA?
- If RDMA is possible for A100, could you suggest to me some reference documents or anything related to RDMA so that I can follow and build my own design?
Thank you so much!!!