|
Sparkrun - central command with tab completion for launching inference on Spark Clusters
|
|
60
|
1527
|
April 6, 2026
|
|
Open-Source CLI Agent Framework for NVIDIA AI Endpoints - Seeking Feedback
|
|
5
|
96
|
April 6, 2026
|
|
Jetson Containers Quickstart on NVIDIA Jetson AGX Orin 64GB
|
|
0
|
23
|
April 5, 2026
|
|
Using GGUFs in two-Spark cluster, doable?
|
|
9
|
243
|
April 5, 2026
|
|
[GB10] vLLM + DeepSeek-R1-32B Stable Setup on Blackwell — Full Protocol After 4 Days of Failures
|
|
5
|
332
|
April 2, 2026
|
|
Running Mistral Small 4 119B NVFP4 on NVIDIA DGX Spark (GB10)
|
|
47
|
2380
|
April 2, 2026
|
|
50%+ Improvement on spark?!
|
|
25
|
1748
|
March 24, 2026
|
|
[GB10] vLLM + DeepSeek-R1-32B on Blackwell aarch64 — 4 more failure modes (v2 protocol)
|
|
0
|
125
|
March 19, 2026
|
|
Phone Verification "Exceeded Limits" Error on First Attempt - India (+91)
|
|
7
|
216
|
March 18, 2026
|
|
How to run NVFP4/DeepSeek-R1-0528-Qwen3-8B-FP4 using eugr/spark-vllm-docker
|
|
9
|
304
|
March 16, 2026
|
|
Building Local + Hybrid LLMs on DGX Spark That Outperform Top Cloud Models
|
|
19
|
4107
|
March 15, 2026
|
|
Ubuntu Inference Snaps
|
|
0
|
82
|
March 1, 2026
|
|
Need hosted API access for nvidia/nemotron-3-nano-30b-a3b
|
|
0
|
90
|
February 22, 2026
|
|
High inference latency
|
|
5
|
277
|
March 5, 2026
|
|
Best Inference Framework & Open Models for Orchestrator-Workers Agentic Coding on GB10 + 5090 Hybrid?
|
|
1
|
447
|
February 19, 2026
|
|
NVIDIA B200: NCCL WARN Cuda failure 700 'an illegal memory access was encountered'
|
|
5
|
130
|
February 19, 2026
|
|
How to build a specific vLLM version (0.11.1) on Jetson Orin AGX (CUDA 12.6 / JetPack 6.2 r36.4.3)?
|
|
8
|
215
|
February 9, 2026
|
|
DGX Spark Completely Inoperable - Need Help (USB Boot Fails, UEFI Inaccessible, System Frozen)
|
|
7
|
374
|
January 31, 2026
|
|
GDX Spark is extremely slow on a short LLM test
|
|
21
|
3356
|
January 25, 2026
|
|
Overcoming Compute and Memory Bottlenecks with FlashAttention-4 on NVIDIA Blackwell
|
|
0
|
175
|
January 22, 2026
|
|
Deepseek-r1-distill-llama-8b 502 Bad Gateway Error
|
|
3
|
79
|
January 15, 2026
|
|
Deepseek v3.2 404 not found the function
|
|
2
|
345
|
January 9, 2026
|
|
Native tool calls fail on DeepSeek 3.2
|
|
2
|
369
|
January 9, 2026
|
|
Repeated Gateway Timeout errors with deepseek-3.2
|
|
0
|
166
|
December 29, 2025
|
|
DeepSeek V3.2 API returns 404 error today after yesterday's work
|
|
0
|
217
|
December 28, 2025
|
|
Deepseek-v3.2: Function 'xxx': Not found for account 'yyy'
|
|
0
|
135
|
December 28, 2025
|
|
Inferencing models from api taking very long
|
|
1
|
195
|
December 19, 2025
|
|
Noob, It doesn't work for some reason. need help for clarity
|
|
1
|
188
|
December 18, 2025
|
|
NVIDIA GPU-Accelerated Sirius Achieves Record-Setting ClickBench Record
|
|
0
|
70
|
December 15, 2025
|
|
Announcing new VLLM container & 3.5X increase in Gen AI Performance in just 5 weeks of Jetson AGX Thor Launch
|
|
46
|
3631
|
December 14, 2025
|