openmpi crashes under MPI test run, is there a workaround ?

We are seeing the sample problem with Mellanox on lentos CentOS Linux release 8.1.1911 (Core)

Manx card installed: [root@client2 ~]# lspci | grep Mell

03:00.0 Ethernet controller: Mellanox Technologies MT27500 Family [ConnectX-3]

[root@client2 ~]#

with mdtest run:


No OpenFabrics connection schemes reported that they were able to be

used on a specific port. As such, the openib BTL (OpenFabrics

support) will be disabled for this port.

Local host: client2

Local device: mlx4_0

Local port: 1

CPCs attempted: rdmacm, udcm

[client2:2394 :0:2394] Caught signal 11 (Segmentation fault: invalid permissions for mapped object at address 0x7fc54b7d6768)

==== backtrace ====

0 /lib64/ [0x7fc54b169bb0]

1 /lib64/ [0x7fc54b169d8a]

2 /lib64/ [0x7fc5506f955b]

3 /lib64/ [0x7fc55e453d0a]

4 /lib64/ [0x7fc55e453e0a]

5 /lib64/ [0x7fc55e457def]

6 /lib64/ [0x7fc55d8ecab7]

7 /lib64/ [0x7fc55e45765e]

8 /lib64/ [0x7fc55d0461ba]

9 /lib64/ [0x7fc55d8ecab7]

10 /lib64/ [0x7fc55d8ecb53]

11 /lib64/ [0x7fc55d046939]

12 /lib64/ [0x7fc55d04625a]

13 /usr/lib64/openmpi/lib/ [0x7fc55d2b6f05]

14 /usr/lib64/openmpi/lib/libopen-pal.s

15 /usr/lib64/openmpi/lib/ [0x7fc55d293a5a]

16 /usr/lib64/openmpi/lib/ [0x7fc55d29f3ce]

17 /usr/lib64/openmpi/lib/ [0x7fc55d29f8b2]

18 /usr/lib64/openmpi/lib/ [0x7fc55d29f915]

19 /usr/lib64/openmpi/lib/ [0x7fc55dde8494]

20 /usr/lib64/openmpi/lib/ [0x7fc55de186b2]

21 ./mdtest() [0x407f24]

22 /lib64/ [0x7fc55d7d7873]

23 ./mdtest() [0x401a8e]


Segmentation fault (core dumped)