HPCG for Nvidia Tesla platform

drmichaelt · October 9, 2015, 11:15pm

Hello, I was wondering if there is a Nvidia library optimized High-Performance Conjugate-Gradient (HPCG) implementation available for downloading. Target is a Haswell cluster of K80 GPUs.

Thanks!
Michael

mfatica · October 11, 2015, 7:30pm

The binary is available at http://www.hpcg-benchmark.org/software/index.html

drmichaelt · October 13, 2015, 3:37pm

Thanks for the pointer drM … Does it adapt its run-time behavior based on the total number of available GPUs per node?

Best regards
Michael

mfatica · October 14, 2015, 2:15pm

Yes, you schedule one MPI rank per GPU and the code automatically assigns one.

For example, to run on two nodes with 4 GPUs each, a similar command should be specified:
mpirun -np 8 ./xhpcg_cuda6.5_openmpi

dribbroc · December 2, 2015, 10:27am

Does there exist a version for the tegra k1 board, too?
Unfortunately, the binary x86 version is not compatible with this arm board :(

mfatica · December 3, 2015, 3:23pm

There is no public binary for ARM.

dribbroc · December 11, 2015, 11:52am

Thank you for your answer.

If its not public available, is there a way to get it non-public for internal use only?