Thank you Davide,
you were right, Connectx5 relaxed mode was disabled,
I enabled it with:
sudo mlxconfig -d mlx5_0 set PCI_WR_ORDERING=1 ; then reboot
nv_rsync is loaded early at boot by service script right after nvidia-tesla:
nv_peer_mem 8513 0
nvidia_uvm 1107130 0
nvidia_modeset 1342589 0
nv_rsync_mem 16446 1
nvidia 21702034 31 nvidia_uvm,nv_peer_mem,nv_rsync_mem,nvidia_modeset
but once enabled, it does not change the BW…
IBMNPU
Relaxed Ordering Mode : Enabled
PCI
Bus : 0x04
Device : 0x00
Domain : 0x0004
Device Id : 0x1DB510DE
GPU0 and mlx5_0 are on the same Power9/PCIe
our Cnx5 hw has a different ID than yours (i checked that sniffer is off)
0000:01:00.0 Ethernet controller: Mellanox Technologies MT28800 Family [ConnectX-5 Ex]
0000:01:00.1 Ethernet controller: Mellanox Technologies MT28800 Family [ConnectX-5 Ex]
0030:01:00.0 Ethernet controller: Mellanox Technologies MT27710 Family [ConnectX-4 Lx]
0030:01:00.1 Ethernet controller: Mellanox Technologies MT27710 Family [ConnectX-4 Lx]
hca_id: mlx5_0
transport: InfiniBand (0)
fw_ver: 16.24.8000
node_guid: 9803:9b03:0005:2816
sys_image_guid: 9803:9b03:0005:2816
vendor_id: 0x02c9
vendor_part_id: 4121
hw_ver: 0x0
board_id: IBM0000000020
We have MLNX_OFED_LINUX-4.7-3.2.9.0-ubuntu18.04 on a debian system. No issue during installation.
ii ibverbs-utils 41mlnx1-OFED.4.7.0.0.2.47329 ppc64el Examples for the libibverbs library
ii libibverbs-dev 41mlnx1-OFED.4.7.0.0.2.47329 ppc64el Development files for the libibverbs library
ii libibverbs1 41mlnx1-OFED.4.7.0.0.2.47329 ppc64el Library for direct userspace use of RDMA (InfiniBand/iWARP)
ii libmlx5-1 41mlnx1-OFED.4.7.0.3.3.47329 ppc64el Userspace driver for Mellanox ConnectX InfiniBand HCAs
ii libmlx5-dev 41mlnx1-OFED.4.7.0.3.3.47329 ppc64el Development files for the libmlx5 driver
ii librdmacm-dev 41mlnx1-OFED.4.7.3.0.6.47329 ppc64el Development files for the librdmacm library
ii librdmacm1 41mlnx1-OFED.4.7.3.0.6.47329 ppc64el Userspace RDMA Connection Manager
ii mlnx-ofed-kernel-dkms 4.7-OFED.4.7.3.2.9.1.g457f064 all DKMS support for mlnx-ofed kernel modules
ii mlnx-ofed-kernel-utils 4.7-OFED.4.7.3.2.9.1.g457f064 ppc64el Userspace tools to restart and tune mlnx-ofed kernel modules
Linux scisoft15 4.19.0-6-powerpc64le #1 SMP Debian 4.19.67-2+deb10u2 (2019-11-11) ppc64le GNU/Linux
nv_rsync_mem from spectrum MPI.
something strange reported by nvidia-smi on Tesla PCIe PCIe width not set at x16:
IBMNPU
Relaxed Ordering Mode : Enabled
PCI
Bus : 0x04
Device : 0x00
Domain : 0x0004
Device Id : 0x1DB510DE
Bus Id : 00000004:04:00.0
Sub System Id : 0x124910DE
GPU Link Info
PCIe Generation
Max : 3
Current : 3
Link Width
Max : 16x
Current : 2x
lspci report the same:
LnkSta: Speed 8GT/s, Width x2, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
but it would not be consistent with the observed BW to GPU by a factor of 2, neither with BW to CPU memory (97%) ???