Intermittent timeout when modifying QP to RTR with RoCE

ctang1207 · June 26, 2025, 6:24pm

I am using CX6 cards (4123).
I create 4 QPs, wiring to 4 different remote QPs, the first two QP were successful. The third one timeout (errno=110).
At first I think there is a network issue that the NIC can’t resolve the remote IP’s MAC. so I print the MAC entry before and after the ibv_modify_qp(RTR) call.
before, there is no MAC entry, but right after the call, the MAC entry exists. However, ibv_modify_qp(RTR) already failed.
This is intermittent and hard to reproduce.
In the Mellanox driver, what is the reason to generate errno=110 with ibv_modify_qp(RTR) ?

Thanks for any suggestion for solve this problem.

By the way, two years ago, there was a similar question:

But it was closed and I did not get the answer from the dialog.

Topic		Replies	Views
Intermittent Connection timed out while setting RC QP to RTR InfiniBand/VPI Adapter Cards	3	1624	August 31, 2023
How to change ack timeout value in CX5 InfiniBand/VPI Adapter Cards	1	111	October 16, 2024
In the "Raw Ethernet Programming: Basic Introduction - Code Example" post, when trying to run the receiver code, it fails when trying to create the qp. "I get Couldn't create RSS QP" and the code exits. Any thoughts on what might be wrong? Thanks. Software And Drivers ethernet , qp , port , programming	1	964	March 4, 2020
How can I obtain the real rate_limit of qp Mellanox OFED	4	1338	November 17, 2022
CX3 SR-IOV: TX timeout on queue: 4 InfiniBand/VPI Adapter Cards	2	1244	May 21, 2020
While disabling the Virtual kernel mode driver running with heavy traffic, API modify_qp() is not responding properly. It gets hanged	0	160	March 9, 2015
Random packet loss when using raw packet QPs and L2 flows Mellanox OFED	0	277	December 7, 2017
I have problem porting my RDMA application from InfiniBand(Mellanox Connectx-3 40Gb IB) to RoCE(Connectx-4 100GbE). Mellanox OFED	2	723	April 3, 2016
How do I change the retransmission interval to 30ms on a Mellanox NIC? Ethernet Adapter Cards	2	59	January 10, 2025
RoCE dramatic loss in throughput. Software And Drivers infiniband , adapters-and-cables , roce	1	527	May 12, 2020

Intermittent timeout when modifying QP to RTR with RoCE

Related topics