QUERY_FW command failed, err=-5, aborting when rebooting machine

Getting this error when trying to get a Connect EN card working on RHEL 6.3 64 bit. Have install the 2.4 drivers

[root@XXXX ~]# cat /var/log/dmesg | grep ml

Compat-mlnx-ofed backport release: 6a710e6

Backport based on mlnx_ofed/mlnx-ofed-2.x-kernel.git 6a710e6

compat.git: mlnx_ofed/mlnx-ofed-2.x-kernel.git

mlx4_core: Mellanox ConnectX core driver v2.4-1.0.0.1 (Mar 10 2015)

mlx4_core: Initializing 0000:04:00.0

mlx4_core 0000:04:00.0: PCI INT A → GSI 38 (level, low) → IRQ 38

mlx4_core 0000:04:00.0: setting latency timer to 64

mlx4_core 0000:04:00.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vendor for a firmware update.

mlx4_core 0000:04:00.0: command QUERY_FW (0x4) timed out (go bit not cleared)

mlx4_core 0000:04:00.0: mlx4_enter_error_state: device is going to be reset

mlx4_core 0000:04:00.0: mlx4_enter_error_state: device was reset successfully

mlx4_core 0000:04:00.0: mlx4_enter_error_state: end

mlx4_core 0000:04:00.0: QUERY_FW command failed, err=-5, aborting.

mlx4_core 0000:04:00.0: Failed to init fw, aborting.

mlx4_core 0000:04:00.0: PCI INT A disabled

mlx4_core: probe of 0000:04:00.0 failed with error -5

Here is the firmware information about the card:

more /tmp/install-mlx4_en.log.3695_fw_update.log

CMD: /tmp/sfxter_ZuuLy6/mlxfwmanager_sriov_dis_x86_64/mlxfwmanager_pci -u --log-

on-update --ssl-certificate /tmp/sfxter_ZuuLy6/mlxfwmanager_sriov_dis_x86_64/ca-

bundle.crt --current-dir /tmp/10g/mlnx-en-2.4-1.0.0.1/ -L /tmp/install-mlx4_en.

log.3695_fw_update.log -y

Querying Mellanox devices firmware …

Device #1:


Device Type: ConnectX2

Part Number: MNPA19_A1-A2

Description: ConnectX-2 Lx EN network interface card; single-port SFP+; P

CIe2.0 5.0GT/s; mem-free; RoHS R6

PSID: MT_0F60110010

PCI Device Name: 0000:04:00.0

Port1 MAC: 0002c94ee93c

Port2 MAC: 0002c94ee93d

Versions: Current Available

FW 2.9.1000 2.9.1000

PXE 3.3.0400 N/A

Status: Up to date

When I run lspci I get the information below:

04:00.0 Ethernet controller: Mellanox Technologies MT26448 [ConnectX EN 10GigE, PCIe 2.0 5GT/s] (rev b0)

Any help you can provide would be appreciated.

Hi,

It looks like the Firmware is stuck, can you try to re-flash your HCA with the same Firmware and check ?

http://www.mellanox.com/downloads/firmware/fw-ConnectX2-rel-2_9_1200-MNPA19_A1-A3-FlexBoot-3.3.400.bin.zip http://www.mellanox.com/downloads/firmware/fw-ConnectX2-rel-2_9_1200-MNPA19_A1-A3-FlexBoot-3.3.400.bin.zip

mst start

flint -d /dev/mst/mtxxxx_pciconf0 -i image b

Reboot

Have you possibly previously burned this card with --allow-psid-change flag ?

I actually got around it by using an older version of the drivers. I used

the 1.5.10 version. I think it was the fact that the card I have is not

supported in the latest driver releases.