Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut

jwitsoe · September 11, 2023, 4:00pm

Originally published at: https://developer.nvidia.com/blog/leading-mlperf-inference-v3-1-results-gh200-grace-hopper-superchip-debut/

AI is transforming computing, and inference is how the capabilities of AI are deployed in the world’s applications. Intelligent chatbots, image and video synthesis from simple text prompts, personalized content recommendations, and medical imaging are just a few examples of AI-powered applications. Inference workloads are both computationally demanding and diverse, requiring that platforms be able…

alice95123 · October 3, 2023, 1:15pm

Hi! I’m trying to run MLPerf on my A100 GPU with MIG mode. I follow the document and have some questions…

I hope to run these benchmarks on MIG slices as described on this page, but I didn’t find scripts/launch_heterogeneous_mig.py in the repo.
This implementation uses benchmark configuration for each benchmark, for example, the server mode of resnet50. I wonder if you have a document to introduce each field with more detail? It can help me understand the meaning of every field and adjust them for my experiment.

Topic		Replies	Views
NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records Technical Blog	1	267	March 27, 2024
NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1 Technical Blog	2	25	August 28, 2024
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost Technical Blog jetson	2	154	December 19, 2024
Setting New Records at Data Center Scale Using NVIDIA H100 GPUs and NVIDIA Quantum-2 InfiniBand Technical Blog	0	318	November 8, 2023
NVIDIA Blackwell Delivers World-Record DeepSeek-R1 Inference Performance Technical Blog	2	72	March 24, 2025
NVIDIA Jetson Orin Nano 개발자 키트, “슈퍼” 부스트 Technical Blog - South Korea jetson	1	46	December 20, 2024
Maximizing Deep Learning Performance on NVIDIA Jetson Orin with DLA Technical Blog	0	388	August 16, 2023
Develop for All Six NVIDIA Jetson Orin Modules with the Power of One Developer Kit Technical Blog	1	504	October 12, 2022
Perfomances drop after AGX Orin update Jetson AGX Orin cudnn	7	75	March 28, 2025
Examples for Deployment of and Inference with Pretrained Custom PyTorch-Based Models on Jetson Orin Nano Jetson Orin NX pytorch	13	84	May 25, 2025

Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut

Related topics