Updated CentOS 7.9.2009 kernel using elrepo-kernel. mlnxofedinstall --add-kernel-support failed

I updated CentOS 7.9.2009 kernel using elrepo-kernel.

And then when I tried to run mlnxofedinstall --add-kernel-support, it failed.

I installed both the mainline kernel and also the mainline-devel kernel from elrepo-kernel.

uname -r

5.15.11-1.el7.elrepo.x86_64

ofed_info -n

5.5-1.0.3.2

./mlnxofedinstall --add-kernel-support

Note: This program will create MLNX_OFED_LINUX TGZ for rhel7.9 under /tmp/MLNX_OFED_LINUX-5.5-1.0.3.2-5.15.11-1.el7.elrepo.x86_64

directory.

See log file /tmp/MLNX_OFED_LINUX-5.5-1.0.3.2-5.15.11-1.el7.elrepo.x86_64/mlnx_iso.24347_logs/mlnx_ofed_iso.24347.log

Checking if all needed packages are installed…

Building MLNX_OFED_LINUX RPMS . Please wait…

ERROR: Failed executing "MLNX_OFED_SRC-5.5-1.0.3.2/install.pl --tmpdir /tmp/MLNX_OFED_LINUX-5.5-1.0.3.2-5.15.11-1.el7.elrepo.x86_6

4/mlnx_iso.24347_logs --kernel-only --kernel 5.15.11-1.el7.elrepo.x86_64 --kernel-sources /lib/modules/5.15.11-1.el7.elrepo.x86_64

/build --builddir /tmp/MLNX_OFED_LINUX-5.5-1.0.3.2-5.15.11-1.el7.elrepo.x86_64/mlnx_iso.24347 --disable-kmp --build-only --distro

rhel7.9"

ERROR: See /tmp/MLNX_OFED_LINUX-5.5-1.0.3.2-5.15.11-1.el7.elrepo.x86_64/mlnx_iso.24347_logs/mlnx_ofed_iso.24347.log

Failed to build MLNX_OFED_LINUX for 5.15.11-1.el7.elrepo.x86_64

systemctl status openibd

? openibd.service - openibd - configure Mellanox devices

Loaded: loaded (/usr/lib/systemd/system/openibd.service; enabled; vendor preset: disabled)

Active: failed (Result: exit-code) since Fri 2021-12-24 01:40:37 EST; 34min ago

Docs: file:/etc/infiniband/openib.conf

Process: 1076 ExecStart=/etc/init.d/openibd start bootid=%b (code=exited, status=3)

Main PID: 1076 (code=exited, status=3)

Dec 24 01:40:36 aes18 openibd[1076]: [93B blob data]

Dec 24 01:40:36 aes18 openibd[1076]: [95B blob data]

Dec 24 01:40:36 aes18 openibd[1076]: [94B blob data]

Dec 24 01:40:36 aes18 openibd[1076]: [49B blob data]

Dec 24 01:40:36 aes18 openibd[1076]: [93B blob data]

Dec 24 01:40:37 aes18 openibd[1076]: [94B blob data]

Dec 24 01:40:37 aes18 systemd[1]: openibd.service: main process exited, code=exited, status=3/NOTIMPLEMENTED

Dec 24 01:40:37 aes18 systemd[1]: Failed to start openibd - configure Mellanox devices.

Dec 24 01:40:37 aes18 systemd[1]: Unit openibd.service entered failed state.

Dec 24 01:40:37 aes18 systemd[1]: openibd.service failed.

I’ve attached the log files and hopefully that will be of use/help.

Your help is greatly appreciated.

Thank you.

mlnx_ofed_iso.24347.log (3.07 KB)

mlnx-ofa_kernel-5.5.rpmbuild.log (322 KB)

Hi Chan

  • Your system key info

CentOS 7.9.2009

#uname -r

5.15.11-1.el7.elrepo.x86_64

#ofed_info -n

5.5-1.0.3.2

Unfortunately, there are kernel compatibility issues when using Linux kernel 5.15.x.

NVIDIA is only supporting GA kernel version (5.15). Any other kernel version (5.15.x, x>0) is not officially supported.

According to OFED release notes: https://docs.mellanox.com/display/MLNXOFEDv551032/General+Support

NVIDIA only support either CentOS 7.9 - 3.10.0-1160.el7.x86_64 or kernel 5.15 GA.

The changes in Linux kernel 5.15.x are going to be supported in a future OFED release which will support kernel 5.16 GA.

Thanks