A100 PCIe HPL-AI poor performance

user7076 · January 27, 2022, 12:07pm

Hi,

We’ve recently purchased a generic server with 6 A100 PCIe cards and a dedicated HGX-A100.
The result of HPL and HPCG benchmarks using NGC images are within expected range.

If the performance is confined within the first NVLink group and socket, the HPL-AI results are follow:

For 4 A100 with PCIe, the performance is ~ 100 TFlops
For 4 A100 with SMX4, the performance is ~ 400 TFlops

I think the PCIe version is grossly under performed, so:

What is the reason for such large difference ?
How can I debug this issue and improve performnace of PCIe version for AI application ?

Thanks.

TomNVIDIA · January 27, 2022, 5:18pm

Hello,

This is not a DGX issue, so I am going to move your topic over to the GPU Hardware category for better visibility.

Cheers,
Tom K

Topic		Replies	Views
A100 PCIe HPL-AI poor performance NGC GPU Cloud hw , cuda	0	476	January 27, 2022
[HPC-Benchmarks] Discrepancy between A100 PCIe and A100 SMX4 NGC GPU Cloud cuda	2	1713	January 27, 2022
H100 HPL results Container: HPC	0	249	June 29, 2024
H100 PCIe hgemm cannot reach peak performance GPU-Accelerated Libraries cublas , cutlass	4	377	May 6, 2024
HPL benchmark on A100(40GB PCIe) GPU-Accelerated Libraries cuda	1	1326	May 8, 2022
test HPL and HPCG in DGX DGX User Forum	1	961	September 4, 2021
HPL-AI Now Runs 2x Faster on NVIDIA DGX A100 Technical Blog	0	579	April 28, 2021
Nccl-test poor performance GPU-Accelerated Libraries	3	172	October 29, 2024
H100 PCIe doesn't have graphic support? GPU - Hardware	2	1603	March 27, 2023
Peculiar Performance of H200 in HPL-MxP Benchmark GPU-Accelerated Libraries	0	70	November 24, 2024

A100 PCIe HPL-AI poor performance

Related topics