I have been trying to set up a small computing cluster of 4 computers just using connectX-5 HCAs. We have one Master computer with two cards (a double and a single port) and then three Slave computers each with a single port. When I just have a master and slave computer hooked up it works fine, but when I start adding more slaves, the connection drops between the other system.
Do I need to have some sort of connection manager set up? any advice of how to set this up would be greatly appreciated. (or at least a point to the documentation would be nice)
Thanks in advance,