Hello. I’m want to use Infiniband on Jetson TX2(with Ubuntu 16.04) for sending data (6-8 GBit/s) on MCX413A-BCAT
I’m downloaded and installed MLNX_OFED_LINUX-4.5-1.0.1.0-ubuntu16.04-aarch64 package sucessful. but when try run services, i got error:
root@jetson:~# /etc/init.d/openibd start
Loading HCA driver and Access Layer: [FAILED]
and in dmesg I see:
[ 3668.345368] mlx5_0:wait_for_async_commands:659:(pid 26302): done with all pending requests
[ 3668.669829] (0000:01:00.0): E-Switch: cleanup
[ 3685.813786] Compat-mlnx-ofed backport release: b4fdfac
[ 3685.820167] Backport based on mlnx_ofed/mlnx-ofa_kernel-4.0.git b4fdfac
[ 3685.827897] compat.git: mlnx_ofed/mlnx-ofa_kernel-4.0.git
[ 3685.888572] mlx5_core 0000:01:00.0: firmware version: 12.24.1000
[ 3685.896043] mlx5_core 0000:01:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 5 GT/s x4 link at 0000:00:01.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
[ 3686.185440] (0000:01:00.0): E-Switch: Total vports 1, per vport: max uc(1024) max mc(16384)
[ 3686.199990] mlx5_core 0000:01:00.0: Port module event: module 0, Cable plugged
[ 3686.212889] mlx5_core 0000:01:00.0: FW Tracer Owner
[ 3686.217414] mlx5_core 0000:01:00.0: MLX5E: StrdRq(0) RqSz(1024) StrdSz(256) RxCqeCmprss(1)
[ 3686.380081] mlx5_ib: Mellanox Connect-IB Infiniband driver v4.5-1.0.1
[ 3686.388350] mlx5_ib: Mellanox Connect-IB Infiniband driver v4.5-1.0.1
[ 3686.484087] user_mad: couldn’t register device number
[ 3686.791433] mlx5_core 0000:01:00.0 eth1: Link up
[ 3686.797921] 8021q: adding VLAN 0 to HW filter on device eth1
while researching problem, i found ib_umad module cannot be loaded and generates this error:
root@jetson:~# modprobe ib_umad
modprobe: ERROR: could not insert ‘ib_umad’: Device or resource busy
as result infiniband not works:
root@jetson:~# sminfo
ibwarn: [1614] get_abi_version: can’t read ABI version from /sys/class/infiniband_mad/abi_version (No such file or directory): is ib_umad module loaded?
ibwarn: [1614] mad_rpc_open_port: can’t open UMAD port ((null):0)
sminfo: iberror: failed: Failed to open ‘(null)’ port ‘0’
root@jetson:~# ibp
ibping ibportstate ibprintca.pl ibprintrt.pl ibprintswitch.pl
root@jetson:~# ibping 10.10.5.1
ibwarn: [1615] get_abi_version: can’t read ABI version from /sys/class/infiniband_mad/abi_version (No such file or directory): is ib_umad module loaded?
ibwarn: [1615] mad_rpc_open_port: can’t open UMAD port ((null):0)
ibping: iberror: failed: Failed to open ‘(null)’ port ‘0’
root@jetson:~#