AI Reasoning with Llama Nemotron at GTC25 | Announcements

calexiuk · March 18, 2025, 7:07pm

We hope you got a chance to watch NVIDIA CEO Jensen Huang’s keynote at GTC.

Today, NVIDIA announced NVIDIA Llama Nemotron, an open family of leading AI models that deliver exceptional reasoning capabilities, compute efficiency, and an open license for enterprise use.

The family comes in three sizes, providing developers with the right model size based on their use case, compute availability, and accuracy requirements.

Nano: 8B distilled from Llama 3.1 8B for highest accuracy on PC and edge.
Super: 49B distilled from Llama 3.3 70B for best accuracy with highest throughput on a data center GPU. This model is the focus of this post.
Ultra: 253B distilled from Llama 3.1 405B for maximum agentic accuracy on multi-GPU data center servers (coming soon).

The Llama Nemotron with reasoning models provide best-in-class accuracy across industry-standard reasoning and agentic benchmarks: GPQA Diamond, AIME 2025, MATH 500, and BFCL, as well as Arena Hard.

Topic		Replies	Views
Introducing Llama Nemotron Ultra: Peak Accuracy Meets Unmatched Efficiency Announcements nim , llama , agentic-ai , llama-nemotron	0	290	April 8, 2025
Just Released: NVIDIA Llama Nemotron Ultra as NVIDIA NIM Technical Blog nim , llama	1	188	April 10, 2025
Build Enterprise AI Agents with Advanced Open NVIDIA Llama Nemotron Reasoning Models Technical Blog llama , agentic-ai	1	108	March 18, 2025
Meet NVIDIA Llama Nemotron Nano 4B Announcements jetson , llama , llama-nemotron	0	331	May 23, 2025
Advancing Agentic AI with NVIDIA Nemotron Open Reasoning Models Technical Blog agentic-ai	1	73	June 11, 2025
NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy Technical Blog llama	1	183	April 15, 2025
Build More Accurate and Efficient AI Agents with the New NVIDIA Llama Nemotron Super v1.5 Technical Blog llama , agentic-ai	1	119	July 28, 2025
Advancing the Accuracy-Efficiency Frontier with Llama-3.1-Nemotron-51B NVIDIA Nemotron nim , llm , llama	0	120	September 23, 2024
Llama Nemotron Models Accelerate Agentic AI Workflows with Accuracy and Efficiency Technical Blog llama	1	85	January 7, 2025
Advancing the Accuracy-Efficiency Frontier with Llama-3.1-Nemotron-51B Technical Blog llama	3	146	October 24, 2024

AI Reasoning with Llama Nemotron at GTC25 | Announcements

Related topics