The systems I used are with Intel Xeon and the two servers have identical configuration. Both of them have CX5 in the same PCI slot and the servers and KVM guests OS are OEL7.6 with UEK5 kernel. I could do rping in between the two bare metal servers successfully. Then I enabled SRIOV of CX5 and created two VFs on the same CX5 port. Then assigned one CX5 VF to the VM each. Started VMs and setup CX5 in VMs. They can ping each other fine but rping in initiator side failed with segfault:
No I am not using MLNX_OFED driver. I am using drivers and utilities from Oracle Linux 7 RDMA for UEK5 yum site. We don’t have MLNX_OFED for Linux 7 UEK5. The MLNX_OFED driver is supposed to be in Linux 7 UEK5 as I understand. I am using the same drivers and utilities in both hosts and VMs but the rping failed in VMs. I have gone through enabling SR-IOV and creating VFs and PCI passthrough VF to VM. I could see the CX5 with lspci in the KVM guests such as:
Verified that after updated firmware to version 16.28.2006 and used ip link set mac address of the VF in host. Although the VF node_guid showed in host ibv_devices were still 0 but showed non-zero value in the KVM guest. And then rping running successfully. Confirmed that I hit the known issue 1047616.