Meet NVIDIA Llama Nemotron Nano 4B

๐Ÿค Meet NVIDIA Llama Nemotron Nano 4B, an open reasoning model that provides leading accuracy and compute efficiency across scientific tasks, coding, complex math, function calling, and instruction following for edge agents.

โœจ Achieves higher accuracy and 50% higher throughput than other leading open models with 8 billion parameters

๐Ÿ“— Supports hybrid reasoning, optimizing for inference cost

๐Ÿง‘โ€๐Ÿ’ป Deploy at the edge with NVIDIA Jetson and NVIDIA RTX GPUs, maximizing security, and flexibility

๐Ÿ“ฅ Now on Hugging Face: nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1 ยท Hugging Face

๐Ÿ“น How to get started with Llama Nemotron Nano 4B: https://youtu.be/HTPiUZ3kJto