Hello.
I’m struggling with OpenMPI.
We have three linux clusters and one is relatively new.
So the versions of OpenMPI installed in each cluster are also different.
The problem is, the behavior of OpenMPI also differs between clusters.
Our program couples OpenMP and MPI.
Each node takes one process, and in every node OpenMP does multithreading.
On old cluster with 1.4.3, following command works well.
mpirun -np 6 -machinefile mf4 ./nTracerV100 Pb3_3D_26G.inp > log_Pb3_26G
It generates 6 processes on 6 nodes and on each node 16 threads are distributed to 16 cores.
But on new cluster with 1.8.3, that command doesn’t work well.
It generates all processes on one node, and each process launches all threads in one core.
Although I add -pernode command, OpenMP still launches 16 threads in one core only.
Since I’m using exactly same program and command, it seems that this problem is coming from version difference.
Can anyone give some advices on what I should check for?
One thing I know about the difference between two versions is that default binding is set to none in older versions than 1.8.
But adding -bind-to none gives some errors like non-zero exit code.