I have a problem with Ubuntu 14.04.6 LTS and a ConnectX-4 Ethernet Card. We've tried with various MLNX_OFED_Linux_4 and 5 and we cannot see the ports, can you help me please?

Distributor ID: Ubuntu

Description: Ubuntu 14.04.6 LTS

Release: 14.04

Codename: trusty

He probado con dos versiones de drivers:

mlnx-en-4.9-3.1.5.0-ubuntu14.04-x86_64.iso

MLNX_OFED_LINUX-5.3-1.0.0.1-ubuntu14.04-x86_64.iso

Y en la instalación de cada driver verificar que el firmware sea e ultimo disponible si no lo tiene instalado lo instala por lo que el de acuerdo a esto el firmware es el último.

Reviso con lspci y obtengo esto:

01:00.0 Ethernet controller: Mellanox Technologies MT27710 Family [ConnectX-4 Lx]

01:00.1 Ethernet controller: Mellanox Technologies MT27710 Family [ConnectX-4 Lx]

De acuerdo al manual una vez instalado los driver debe reiniciar el servicio para ver los puertos:

arca@optiplex-7040:~/Downloads$ sudo /etc/init.d/mlnx-en.d restart

[sudo] password for arca:

Unloading NIC driver: [ OK ]

Loading NIC driver: [ OK ]

La instalación en ambos casos termina sin errores pero aún no tengo disponible los puertos, solamente los puertos ya existentes están disponibles:

1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT group default qlen 1

link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00

2: eth0: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN mode DEFAULT group default qlen 1000

link/ether 90:e2:ba:b8:7b:d0 brd ff:ff:ff:ff:ff:ff

3: eth1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP mode DEFAULT group default qlen 1000

link/ether 48:4d:7e:b0:02:83 brd ff:ff:ff:ff:ff:ff

4: eth2: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN mode DEFAULT group default qlen 1000

link/ether 90:e2:ba:b8:7b:d1 brd ff:ff:ff:ff:ff:ff

Algo que llama mi atención es que en los registros de sistema cuando reinicio el servicio tengo los siguientes mensajes :

May 10 11:10:08 optiplex-7040 kernel: [153550.957504] Compat-mlnx-ofed backport release: 7e619ca

May 10 11:10:08 optiplex-7040 kernel: [153550.957507] Backport based on mlnx_ofed/mlnx-ofa_kernel-4.0.git 7e619ca

May 10 11:10:08 optiplex-7040 kernel: [153550.957508] compat.git: mlnx_ofed/mlnx-ofa_kernel-4.0.git

May 10 11:10:08 optiplex-7040 kernel: [153550.978037] mlx5_core 0000:01:00.0: Missing registers BAR, aborting

May 10 11:10:08 optiplex-7040 kernel: [153550.978041] mlx5_core 0000:01:00.0: mlx5_pci_init:1055:(pid 25366): error requesting BARs, aborting

May 10 11:10:08 optiplex-7040 kernel: [153550.978080] mlx5_core 0000:01:00.0: init_one:2142:(pid 25366): mlx5_pci_init failed with error code -19

May 10 11:10:08 optiplex-7040 kernel: [153550.978334] mlx5_core 0000:01:00.1: Missing registers BAR, aborting

May 10 11:10:08 optiplex-7040 kernel: [153550.978338] mlx5_core 0000:01:00.1: mlx5_pci_init:1055:(pid 25366): error requesting BARs, aborting

May 10 11:10:08 optiplex-7040 kernel: [153550.978375] mlx5_core 0000:01:00.1: init_one:2142:(pid 25366): mlx5_pci_init failed with error code -19

-Javier

Hello Javier,

Thank you for posting your inquiry on the NVIDIA Networking Community.

Based on the information provided, the issue you are facing is not related to the adapter. It is a system BIOS related issue. The system BIOS was not able to setup the BAR address on our devices and as a result the driver failed at startup.

Our recommendation is to update the system BIOS to latest version available, reset all BIOS settings to system default.

If the issue still occurs after these recommendation, please contact the system vendor.

Thank you and regards,

~NVIDIA Networking Technical Support