ConnectX-6 Lx #4 device reports an "EQ stuck" on EQn 0x4. Attempting recovery

knights_jsh · January 7, 2025, 1:25pm

We have a 6-node Windows Server 2022 Failover cluster

We use 2x Mellanox ConnectX-6 Lx adapters in each server purely for iSCSI connection to our SAN. We’re getting the following event on 2 out of the 6 servers:

ConnectX-6 Lx #4 device reports an “EQ stuck” on EQn 0x4. Attempting recovery.

I can’t seem to link this to any particular workload or action on the servers, and we don’t notice any undesired effect. A previous thread suggests it could be related to CPU utilisation or RSS configuration.

CPU utilisation is generally low (~10-15%) with RSS configured:

MaxProcessors - 4
NumberOfReceiveQueues - 8
Profile - NUMAStatic
RSS processor array is different for each interface

Any suggestions as to what could be causing this would be greatly appreciated.

abirman · January 27, 2025, 4:00pm

Hi,

Thanks for your question.
The mentioned message may be caused by CPU load, but you wrote the load is not high.

To understand the potential root cause of this behavior we will need to investigate system logs, and this will require opening of a new support case in Nvidia portal (or sending an email to enterprisesupport@nvidia.com) with all the relevant logs after the issue is observed. Then the case will be handled according to the support entitlement.

Best Regards,
Anatoly

Topic		Replies	Views
Windows S2D Cluster getting "EQ stuck" on EQn 0x4. Attempting recovery. on 3 of 5 servers Ethernet Adapter Cards	1	200	September 19, 2024
Mellanox Connectx-6 Dx adapter slow only 6gbps Throughput Ethernet Adapter Cards performance-tuning-for-mellanox-ada	0	22	March 14, 2025
No connection using connectx-6 Ethernet Adapter Cards	1	980	October 27, 2020
Getting slow speeds on Connectx-4 LX Ethernet Adapter Cards	1	1607	July 19, 2023
ConnectX4 LX: High Idle power consumption despite every requirement on Ethernet Adapter Cards	2	1467	March 20, 2024
Connext-6 lx adapter dies Ethernet Adapter Cards	2	30	February 27, 2025
ConnectX-4 Lx EN are unable to configure the port UP and Active state Ethernet Adapter Cards ethernet-adapter-cards , nics , infinibandvpi-adapter-cards	1	553	June 24, 2024
ConnectX-En 10G crash under load :-(	9	250	May 8, 2013
ConnectX-5/6 OID timeouts Ethernet Adapter Cards	9	1203	March 28, 2025
Can someone tell me what is wrong with this Mellanox 4 cards? Adapters and Cables	1	1629	October 18, 2019

ConnectX-6 Lx #4 device reports an "EQ stuck" on EQn 0x4. Attempting recovery

Related topics