When I try a simple mpif90 compilation with MPI bundled with Nvidia HPC SDK 20.11, I get
/opt/nvidia-20.11/hpc_sdk/Linux_x86_64/20.11/comm_libs/openmpi/openmpi-3.1.5/bin/.bin/mpif90: error while loading shared libraries: librdmacm.so.1: cannot open shared object file: No such file or directory
In /opt/nvidia-20.11/hpc_sdk/Linux_x86_64/20.11/comm_libs/mpi/lib, ‘ll | wc’ responds with 58 lines and librdmacm* is not there.
In the installation tarball in install_components/Linux_x86_64/20.11/comm_libs/mpi/lib, ‘ll | wc’ responds with 64 lines and librdmacm* (and libibverbs*) are there.
When I copy the missing files to the lib directory of MPI from the tarball by hand, mpif90 can compile my source.
With Intel® Core™ i9-9920X CPU @ 3.50GHz (i9 Skylake-X generation), the response to mpif90 in case of incomplete lib directory was different:
/opt/nvidia-20.11/hpc_sdk/Linux_x86_64/20.11/comm_libs/openmpi/openmpi-3.1.5/lib/libopen-pal.so.40: undefined reference to rdma_get_src_port@RDMACM_1.0
/opt/nvidia-20.11/hpc_sdk/Linux_x86_64/20.11/comm_libs/openmpi/openmpi-3.1.5/lib/libopen-pal.so.40: undefined reference to rdma_get_dst_port@RDMACM_1.0
and also disappeared after completing the libraries by hand.
Did I miss something during the installation, or did I make other mistake?