As the title says, I execute
sudo iblinkinfo on the host-side of server A and it reports the error:
ibwarn:  _do_madrpc: send failed; Invalid argument
ibwarn:  mad_rpc: _do_madrpc failed; dport (DR path slid 0; dlid 0; 0)
/var/tmp/rdma-core/rdma-core-54mlnx1/libibnetdisc/ibnetdisc.c:811; Failed to resolve self
However, if I execute the same command on the dpu(Bluefield 2) side of server A, it succeeds.
In fact, when I send RDMA requests from another server B to the host side and dpu side of server A, they fail on the host side of A and succeed on the dpu side of A.
Could someone please provide me some ideas to solve the problem, thanks a lot.
- The link state are normal as
- The ofed version of host and dpu side are both MLNX_OFED_LINUX-5.4-188.8.131.52
- Both host and dpu side are equipped with Ubuntu OS