We’ve been using Infiniband on Red Hat for years with no problems.
This morning suddenly the openibd service won’t start:
systemctl status openibd.service |cat
● openibd.service - openibd - configure Mellanox devices
Loaded: loaded (/usr/lib/systemd/system/openibd.service; enabled; vendor preset: disabled)
Active: failed (Result: exit-code) since Tue 2019-11-26 15:47:34 EST; 22s ago
Docs: file:/etc/infiniband/openib.conf
Main PID: 38660 (code=exited, status=3)
Nov 26 15:47:33 micc openibd[38660]: Module ib_uverbs belong to kernel-debug which is not a part of MLNX_OFED, skipping…[FAILED]
Nov 26 15:47:33 micc openibd[38660]: Module ib_ipoib belong to kernel-debug which is not a part of MLNX_OFED, skipping…[FAILED]
Nov 26 15:47:33 micc openibd[38660]: Loading HCA driver and Access Layer:[ OK ]
Nov 26 15:47:33 micc openibd[38660]: Module rdma_cm belong to kernel-debug which is not a part of MLNX_OFED, skipping…[FAILED]
Nov 26 15:47:33 micc openibd[38660]: Module ib_ucm does not exist, skipping…[FAILED]
Nov 26 15:47:34 micc openibd[38660]: Module rdma_ucm belong to kernel-debug which is not a part of MLNX_OFED, skipping…[FAILED]
Nov 26 15:47:34 micc systemd[1]: openibd.service: main process exited, code=exited, status=3/NOTIMPLEMENTED
Nov 26 15:47:34 micc systemd[1]: Failed to start openibd - configure Mellanox devices.
Nov 26 15:47:34 micc systemd[1]: Unit openibd.service entered failed state.
Nov 26 15:47:34 micc systemd[1]: openibd.service failed.
lspci|grep -i mel
05:00.0 Network controller: Mellanox Technologies MT27500 Family [ConnectX-3]
dmesg|grep -i mell
[ 9.900792] mlx4_core: Mellanox ConnectX core driver v4.0-0
Any ideas what’s wrong?