Concurrent INFINIBAND multicast writers

wayne.lovely · October 11, 2018, 1:59pm

Hi everyone!

I am working on a project in which I have a small set of servers with ConnectX 3 HCAs connected to an IS5030 switch.

No IP, just IB.

Given either 1 or many multicast groups, with one reader and one writer on each machine with the appropriate cpu affinity,

I observe the following behavior:

Only 1 writer in the cluster, everything else reads: the only increments in XmitWait is on the sending HCA that is just trying to get the multicast packets to the switch.

All of the IB counters on everything look great, even at many multiples of message rate compared the problem scenario below.

If I introduce just 1 more multicast writer into the mix and they are both at 5k msg/sec, XmitWait on the transmitting switch ports for the multicast group start growing. The more writers, the worse it gets.

A subnet manager is running on the switch. I have tried segregating the traffic into different VLs and turning on congestion control.

There is something about two machines generating multicast traffic to the same switch at any decent frequency.

I’m using 4k buffers but my message size is only 512 bytes.

Does anyone have any insight into what would be causing the congestion?

Topic		Replies	Views
Multicast via switch between two Windows machines does not work Mellanox OFED	0	326	August 5, 2016
How to use two SX6536 switches to build a full line speed and congestion free network? InfiniBand/VPI Switch Systems	2	393	February 3, 2016
Scalability issue for multiple clients InfiniBand/VPI Adapter Cards	1	478	October 24, 2017
Read port priority counters is 0 . InfiniBand/VPI Switch Systems iterations , bytes	5	732	November 2, 2015
Degrade throughout when one of VLs in congested status InfiniBand/VPI Switch Systems	0	917	September 6, 2022
Concurrent bandwidth test CUDA Programming and Performance	30	46400	April 27, 2012
Dropping lots of UDP packets with simple TX1 configuration Jetson TX1	15	3384	October 18, 2021
Having a little trouble with mutex/synchronisation CUDA Programming and Performance	29	57516	June 5, 2007
Ib_write_bw on cx5 interface fails Software And Drivers	4	3471	April 12, 2019
New to infiniband, can't get a working connection.	22	2261	September 9, 2013

Concurrent INFINIBAND multicast writers

Related topics