I’m not sure if AGX kernel 4.9.140-tegra supports OFED GPUDirect RDMA but I followed this link anyway Mellanox OFED GPUDirect RDMA
I’m able to successfully install MLNX_OFED package as required by the user manual. Then I downloaded GPUDirect RDMA package nvidia-peer-memory_1.1.tar.gz, but then when I tried to build nv_peer_mem I got the following error:
DKMS make.log for nvidia-peer-memory-1.1 for kernel 4.9.140-tegra (aarch64)
Tue Apr 20 22:12:02 EDT 2021
INFO: Building with MLNX_OFED from: /usr/src/ofa_kernel/default
-E- Cannot locate nvidia modules!
CUDA driver must be installed before installing this package!
Makefile:91: recipe for target ‘gen_nv_symvers’ failed
make: *** [gen_nv_symvers] Error 1
I have verified CUDA driver and devel packages are all installed. I’m not sure what I was missing.