Apparently, OFED 23.10 doesn’t work on recent Rocky/RHEL 8.10 kernels (seen today on 4.18.0-553.22.1.el8_10.x86_64). The mlx5_ib is unable to load due to unknown symbol errors. (see below)
When can we expect support for these newer kernels from OFED? Rocky/RHEL 8.10 is listed as supported on Linux InfiniBand Drivers with version MLNX_OFED_LINUX-23.10-3.2.2.0-rhel8.10-x86_64.tgz, but it produces these errors, and I am apparently not able to build from sources either…
For example:
Oct 29 21:51:22 weka64 kernel: mlx5_ib: disagrees about version of symbol ib_umem_release
Oct 29 21:51:22 weka64 kernel: mlx5_ib: Unknown symbol ib_umem_release (err -22)
Oct 29 21:51:22 weka64 kernel: mlx5_ib: disagrees about version of symbol uverbs_get_flags32
Oct 29 21:51:22 weka64 kernel: mlx5_ib: Unknown symbol uverbs_get_flags32 (err -22)
Oct 29 21:51:22 weka64 kernel: mlx5_ib: disagrees about version of symbol uverbs_idr_class
Oct 29 21:51:22 weka64 kernel: mlx5_ib: Unknown symbol uverbs_idr_class (err -22)
Oct 29 21:51:22 weka64 kernel: mlx5_ib: disagrees about version of symbol rdma_read_gid_l2_fields
Oct 29 21:51:22 weka64 kernel: mlx5_ib: Unknown symbol rdma_read_gid_l2_fields (err -22)
Oct 29 21:51:22 weka64 kernel: mlx5_ib: disagrees about version of symbol __ib_alloc_pd
Oct 29 21:51:22 weka64 kernel: mlx5_ib: Unknown symbol __ib_alloc_pd (err -22)
Oct 29 21:51:22 weka64 kernel: mlx5_ib: disagrees about version of symbol ib_modify_qp_is_ok