i was able to successfully run this P2P example between NV GPUs: cuda-samples/Samples/0_Introduction/simpleP2P at master · NVIDIA/cuda-samples · GitHub
now i have an Xilinx FPGA device connected to the same machine, and i want to know if it is possible to use CUDA runtime APIs to realize P2P memory copy between FPGA and GPU.
specifically, I have a NV 1080ti GPU and Xilinx U280 FPGA which are both P2P capable: https://xilinx.github.io/Vitis_Accel_Examples/2020.2/html/p2p_fpga2fpga.html
but it seems that cuda runtime APIs can only enable P2P access between two NV GPUs
checkCudaErrors(cudaSetDevice(gpuid)); checkCudaErrors(cudaDeviceEnablePeerAccess(gpuid, 0));