Just Released: NVIDIA Llama Nemotron Ultra as NVIDIA NIM

Originally published at: llama-3.1-nemotron-ultra-253b-v1 Model by NVIDIA | NVIDIA NIM

Try NVIDIA Llama Nemotron Ultra as an NVIDIA NIM microservice. At only 253B total parameters, it offers reasoning performance that meets or beats top open reasoning models like DeepSeek-R1 while offering considerably higher throughput due to its optimized sizing, and retaining excellent tool calling capabilities.

1 Like