Im sorry to ask again, im new to Infiniband so dont know all the tricks and hwo to make it work just yet:)
I managed to install the software as to previous article, and rebooted the blade node.
when i start again now i get this error:
Loading HCA driver and Access Layer = Failed
Please open an ssue in the http://bugs.openfabrics.org http://bugs.openfabrics.org/ and attach /tmp/ib_debug_info_log.
the debug file is a copy of dmesg and it has the following lines
mlx4_ib 80171 0ib_mad 40497 5 ib_cm,ib_sa,ib_umad,mlx4_ib,ib_mthcaib_core 69979 9 ib_cm,ib_sa,ib_uverbs,ib_umad,iw_nes,iw_cxgb3,mlx4_ib,ib_mthca,ib_madmlx4_en 97664 0mlx4_core 185193 2 mlx4_ib,mlx4_en
mlx4_core: Mellanox ConnectX core driver v1.0-mlnx_ofed1.5.3 (November 3, 2011)
mlx4_core: Initializing 0000:03:00.0
mlx4_core 0000:03:00.0: PCI INT A → GSI 48 (level, low) → IRQ 48
mlx4_core 0000:03:00.0: setting latency timer to 64
mlx4_core 0000:03:00.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vend or for a firmware update.mlx4_core 0000:03:00.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vend or for a firmware update.
mlx4_en: Mellanox ConnectX HCA Ethernet driver v1.5.8.3 (June 2012)
mlx4_ib: Mellanox ConnectX InfiniBand driver v1.0-mlnx_ofed1.5.3 (November 3, 2011)
Apr 28 12:32:48 dpn01 modprobe: FATAL: Error inserting ib_ipoib (/lib/modules/2.6.32-279.el6.x86_64/extra/mln x-ofa_kernel/drivers/infiniband/ulp/ipoib/ib_ipoib.ko): Unknown symbol in module, or unknown parameter (see d mesg)Apr 28 12:44:44 dpn01 modprobe: FATAL: Error inserting ib_ipoib (/lib/modules/2.6.32-279.el6.x86_64/extra/mln x-ofa_kernel/drivers/infiniband/ulp/ipoib/ib_ipoib.ko): Unknown symbol in module, or unknown parameter (see d mesg)
Apr 28 12:48:06 dpn01 root[4494]: Set node_desc for mlx4_0: dpn01 HCA-1
root 15022 0 12:29 ? 00:00:00 [mlx4]root 15042 0 12:29 ? 00:00:00 [mlx4_opreq]root 15682 0 12:29 ? 00:00:00 [mlx4_sense]root 15772 0 12:29 ? 00:00:00 [mlx4_en]root 26512 0 12:29 ? 00:00:00 [mlx4_ib]
it looks like its loading the HCA ethernet drivers, but why fail on the other, is it because of the firmware lines above?
any help appreciated.