|
Nemotron-3-Nano 30B long context retrieval fails on 4 x RTX PRO 6000 (SM120) - NVIDIA vLLM containers perform worse than community vLLM
|
|
5
|
338
|
January 10, 2026
|
|
Help on llama.cpp command line arguments and compilation settings (performance testing included)
|
|
7
|
414
|
January 9, 2026
|
|
Open-Source AI Tool Upgrades Speed Up LLM and Diffusion Models on NVIDIA RTX PCs
|
|
1
|
40
|
January 9, 2026
|
|
DGX Spark – Request AI Enterprise / NIM Entitlement Activation
|
|
8
|
381
|
January 6, 2026
|
|
What prompt processing speed can one expect above 500k ctx?
|
|
6
|
367
|
January 3, 2026
|
|
I stopped trying to make txt2kg generate triples for me
|
|
0
|
65
|
January 3, 2026
|
|
Nemotron-Parse not available
|
|
2
|
105
|
December 29, 2025
|
|
Model deployed on kubernetes is unable to use available GPU memory fully
|
|
2
|
82
|
December 26, 2025
|
|
DGX Spark, Nemotron3, and NVFP4: Getting to 65+ tps
|
|
14
|
941
|
December 22, 2025
|
|
Inferencing models from api taking very long
|
|
1
|
114
|
December 19, 2025
|
|
5M downloads on HuggingFace!
|
|
1
|
55
|
December 19, 2025
|
|
Nemotron 3 Nano 30B with llama.cpp Playbook
|
|
1
|
573
|
December 18, 2025
|
|
DGX Spark (GB10, ARM64) – Embedding NIM llama-3.2-nv-embedqa-1b-v2:1.10.0 fails with cudaErrorSymbolNotFound (onnx runtime)
|
|
2
|
115
|
December 17, 2025
|
|
Playbook for Nemotron 3
|
|
2
|
125
|
December 15, 2025
|
|
Introducing Nemotron 3: Open Models for Agentic AI
|
|
0
|
234
|
December 15, 2025
|
|
Inside NVIDIA Nemotron 3: Techniques, Tools, and Data That Make It Efficient and Accurate
|
|
0
|
80
|
December 15, 2025
|
|
Announcing new VLLM container & 3.5X increase in Gen AI Performance in just 5 weeks of Jetson AGX Thor Launch
|
|
46
|
3165
|
December 14, 2025
|
|
LLaVA-Mistral multimodal (7B & 34B)
|
|
6
|
226
|
December 13, 2025
|
|
GB10 Platform Does Not Support Nemotron-Parse Container
|
|
3
|
129
|
December 11, 2025
|
|
Llama 3.1 nemotron 70b instruct API access not working correctly
|
|
2
|
75
|
December 5, 2025
|
|
Create Your Own Bash Computer Use Agent with NVIDIA Nemotron in One Hour
|
|
14
|
298
|
December 3, 2025
|
|
Inconsistent output from nemoretriever-table-structure-v1 docker image compared to API
|
|
2
|
53
|
December 5, 2025
|
|
At NeurIPS, NVIDIA Advances Open Model Development for Digital and Physical AI
|
|
1
|
88
|
December 1, 2025
|
|
Build and Run Secure, Data-Driven AI Agents
|
|
0
|
48
|
November 24, 2025
|
|
Running nvidia/Nemotron-Nano-VL-12B-V2-NVFP4-QAD on your spark
|
|
4
|
931
|
November 24, 2025
|
|
DGX Spark txt2kg playbook discrepancies / CPU fallback questions
|
|
6
|
429
|
November 24, 2025
|
|
Building Scalable AI on Enterprise Data with NVIDIA Nemotron RAG and Microsoft SQL Server 2025
|
|
0
|
38
|
November 18, 2025
|
|
Build-a-log-analysis-multi-agent-self-corrective-rag-system-with-nvidia-nemotron/
|
|
2
|
142
|
November 14, 2025
|
|
Build a Log Analysis Multi-Agent Self-Corrective RAG System with NVIDIA Nemotron
|
|
2
|
165
|
November 10, 2025
|
|
새로운 NVIDIA Nemotron Vision, RAG, Guardrail 모델로 특화된 AI 에이전트 개발하기
|
|
1
|
36
|
November 3, 2025
|