We have two new Dell servers (R740 with ConnectX-5 MT28800 Dual port adapter) and (R640 with ConnectX-4 MT27700 Dual port adapter) both using 1 x Dell Q28-100G-LR4 optics pr. adapter.
No matter what I do, I am unable to get link using a Corning OS2 cable.
Both adapters are set to Ethernet and all Dell firmware has been updated on the servers.
We are not using the Mellanox nor the Dell drivers, but the inbox drivers in CentOS.
Everytime I plug-in the QSFP, this message is listed in dmesg:
[581258.513322] mlx5_core 0000:3b:00.0: Port module event[error]: module 0, Cable error, Power budget exceeded
On the ConnectX-5 card the following parameters are set pr. default using the inbox drivers:
mstconfig -d 3b:00.0 q|grep “POWER”
DISABLE_SLOT_POWER_LIMITER True(1)
ADVANCED_POWER_SETTINGS True(1)
On the ConnectX-4 cards the settings are not present, and therefore not set.
lspci shows there is plenty of power on the PCIe slot, 75W. The QSFP requires 3.5W max, hence it should have allot of power available.
We are running the following firmware on the adapters:
FW Version: 16.25.4062
FW Release Date: 5.6.2019
Part Number: 09FTMY_071C1T_Ax
Description: Mellanox ConnectX-5 Ex Dual Port 100 GbE QSFP Network Adapter
Product Version: 16.25.4062
Rom Info: type=UEFI version=14.18.19 cpu=AMD64
type=PXE version=3.5.701 cpu=AMD64
FW Version: 12.25.1020
FW Release Date: 30.4.2019
Part Number: 0068F2_0NNJ2M_Ax
Description: Mellanox ConnectX-4 Dual Port EDR PCIE Adapter LP
Product Version: 12.25.1020
Rom Info: type=PXE version=3.5.701 cpu=AMD64
The servers are currently connected back-2-back, and still no connection. What can be causing the connection issue?
Since the problem is seen as the QSFP modules are plugged into the NIC. it seems to be power issues with PCIe?
Can someone help with this?