We are porting parts of the Mellanox OFED stack using the Ubuntu 18.04 version 5.1-2.5.8.0 release to another OS using the ConnectX-5 ethernet NIC. We have the following question: How does the NIC firmware direct a RoCE v2 packet, received via the wire interface, to the applicable destination QP? What are the conditions that could cause a RoCE v2 packet to be silently dropped so that it will not show up at the expected/programmed QP?
We have ported ud_pingpoing to our target OS and are able to send RoCE packets to a Linux host configured with OFED 5.1-2.5.8.0. The Linux host receives that packet and responds with a RoCE packet. We are able to see that this packet is received on the target ConnectX-5 NIC from the rx counters. However, the programmed QP and CQ are not detecting this RoCE packet. We suspect that it is being dropped.
The firmware version on our ConnectX-5 cards is: 16.28.2006 and the board_id is MT_0000000012.