During the DOCA install it adds functionality with HCOLL, UCX, SHARP, as well as compiling an openmpi binary.
HPC-X also does the above.
So I want to understand when to use HPC-X if DOCA already provides it all? If I choose to use HPC-X does that I mean I effectively manage two openmpi binaries and duplicates of HCOLL/UCX/SHARP etc?
Thanks for posting your inquiry to the NVIDIA Developer Forums.
DOCA provides UCX/OpenMPI as a matter of convenience to enable basic functionality, while HPC-X is a comprehensive HPC optimization toolkit containing tuned versions specifically for MPI applications. The packages are designed to coexist, with HPC-X taking precedence for HPC workloads when explicitly loaded.
HPC-X is designed to layer on top of DOCA’s base installation. The module system prevents conflicts by controlling LD_LIBRARY_PATH and binary paths. HPC-X provides tuned and updated versions of these same components with optimizations.
Thanks, and best regards,
NVIDIA Enterprise Experience