In the kernel log, CQ Error occured at mlx5_driver

Hi Experts.

I am using ConnectX-5 and ConnectX-6 NIC at Rocky OS 9.3 system.

After OFED installation, I used dpdk-testpmd api for test these NIC’s performance.

But I’ve been check this system log, there is an error occured like below log.

Using OS : Rocky OS 9.3
Kernel version : 5.14.0-362.8.1.el9_3.x86_64
OFED version : MLNX_OFED_LINUX-23.10-0.5.5.0-rhel9.2-x86_64

#####mlx5_core driver info#######
[root@localhost home]# modinfo mlx5_core
filename: /lib/modules/5.14.0-362.8.1.el9_3.x86_64/extra/mlnx-ofa_kernel/drivers/net/ethernet/mellanox/mlx5/core/mlx5_core.ko
alias: auxiliary:mlx5_core.eth-rep
alias: auxiliary:mlx5_core.eth
basedon: Korg 6.3-rc3
version: 23.10-0.5.5
license: Dual BSD/GPL
description: Mellanox 5th generation network adapters (ConnectX series) core driver
author: Eli Cohen eli@mellanox.com
rhelversion: 9.3
srcversion: 8904317AB68043A6706E21E
alias: pci:v000015B3d0000A2DFsvsdbcsci*
alias: pci:v000015B3d0000A2DCsvsdbcsci*
alias: pci:v000015B3d0000A2D6svsdbcsci*
alias: pci:v000015B3d0000A2D3svsdbcsci*
alias: pci:v000015B3d0000A2D2svsdbcsci*
alias: pci:v000015B3d00001023svsdbcsci*
alias: pci:v000015B3d00001021svsdbcsci*
alias: pci:v000015B3d0000101Fsvsdbcsci*
alias: pci:v000015B3d0000101Esvsdbcsci*
alias: pci:v000015B3d0000101Dsvsdbcsci*
alias: pci:v000015B3d0000101Csvsdbcsci*
alias: pci:v000015B3d0000101Bsvsdbcsci*
alias: pci:v000015B3d0000101Asvsdbcsci*
alias: pci:v000015B3d00001019svsdbcsci*
alias: pci:v000015B3d00001018svsdbcsci*
alias: pci:v000015B3d00001017svsdbcsci*
alias: pci:v000015B3d00001016svsdbcsci*
alias: pci:v000015B3d00001015svsdbcsci*
alias: pci:v000015B3d00001014svsdbcsci*
alias: pci:v000015B3d00001013svsdbcsci*
alias: auxiliary:mlx5_core.sf
depends: tls,mlxdevm,mlx_compat,mlxfw,pci-hyperv-intf,psample
retpoline: Y
name: mlx5_core
vermagic: 5.14.0-362.8.1.el9_3.x86_64 SMP preempt mod_unload modversions
parm: num_of_groups:Eswitch offloads number of big groups in FDB table. Valid range 1 - 1024. Default 15 (uint)
parm: debug_mask:debug mask: 1 = dump cmd data, 2 = dump cmd exec time, 3 = both. Default=0 (uint)
parm: prof_sel:profile selector. Valid range 0 - 3 (uint)
parm: probe_vf:probe VFs or not, 0 = not probe, 1 = probe. Default = 1 (bool)

####### System Logs ######################

[Fri Dec 22 02:50:22 2023] mlx5_core 0000:99:00.0: cq_err_event_notifier:551:(pid 4501): CQ error on CQN 0x580, syndrome 0x1
[Fri Dec 22 02:50:54 2023] mlx5_core 0000:99:00.0: cq_err_event_notifier:551:(pid 4421): CQ error on CQN 0x587, syndrome 0x1
[Fri Dec 22 02:51:08 2023] mlx5_core 0000:99:00.0: cq_err_event_notifier:551:(pid 4423): CQ error on CQN 0x583, syndrome 0x1
[Fri Dec 22 02:51:43 2023] mlx5_core 0000:99:00.0: cq_err_event_notifier:551:(pid 4477): CQ error on CQN 0x582, syndrome 0x1
[Fri Dec 22 02:52:12 2023] mlx5_core 0000:99:00.0: cq_err_event_notifier:551:(pid 4427): CQ error on CQN 0x588, syndrome 0x1
[Fri Dec 22 02:52:29 2023] mlx5_core 0000:99:00.0: cq_err_event_notifier:551:(pid 4423): CQ error on CQN 0x57f, syndrome 0x1

Why these cq errors occured? And how to fix it?

Give me some opinion.

Thanks.

Hello @goo1047,

Thank you for posting your query on our community. Please note that Rocky 9.3 is not a supported OS. Please refer to the list of supported OS in the MLNX_OFED RN available at the below link:
https://docs.nvidia.com/networking/display/mlnxofedv23100550/general+support

Thanks,
Bhargavi