|
DeepSeek Models - newbie python programmer - calling the wizards out there (you know who you are)
|
|
3
|
21
|
April 14, 2026
|
|
Qwen3.5-35B-A3B optimizations on single Spark
|
|
29
|
558
|
April 14, 2026
|
|
Running vLLM-Omni for Qwen3-TTS(voice design, voice clone) on DGX Spark
|
|
9
|
1544
|
April 14, 2026
|
|
HOW-TO: setup-dgx-spark docker inference - A "Sane" Inference Stack for GB10 (Need Contributors!)
|
|
34
|
1506
|
April 14, 2026
|
|
Two multi-node DGX Spark wins: RoCE 2× inference throughput + Qwen3.5-397B-A17B-NVFP4 serving (with SM121 CUTLASS patch)
|
|
2
|
216
|
April 14, 2026
|
|
Support for Qwen3-TTS on DGX Spark (GB10) | torchaudio installation failure on ARM64
|
|
6
|
741
|
April 14, 2026
|
|
[Guide] Uncensored Gemma-4-26B at 45 tok/s on DGX Spark — Actually Feels Great to Use!
|
|
0
|
236
|
April 13, 2026
|
|
My DGX Spark Hangs ... is this normal?
|
|
4
|
156
|
April 13, 2026
|
|
Qwen3.5-397B-A17B + DGX Spark (duo)
|
|
56
|
4424
|
April 13, 2026
|
|
Multilingual Speech-to-Text STT / ASR with Nvidia parakeet-tdt-0.6b-v3 for the DGX Spark
|
|
5
|
237
|
April 13, 2026
|
|
Marlin Fix: NVFP4 Actually Works on SM121 (DGX Spark)
|
|
15
|
1343
|
April 12, 2026
|
|
RAGFlow v0.24.0 on DGX Spark - working native ARM64 build with GPU-accelerated OCR
|
|
0
|
82
|
April 12, 2026
|
|
DGX Spark RAG on Docker
|
|
1
|
183
|
April 12, 2026
|
|
Wan2GP on the DGX Spark WAN 2.2 Docker Container
|
|
1
|
658
|
April 12, 2026
|
|
NemoClaw failed adding Telegram channel
|
|
6
|
162
|
April 11, 2026
|
|
RedHatAI/Qwen3.5-122B-A10B-NVFP4 seems to be the best option for a single Spark
|
|
74
|
4337
|
April 11, 2026
|
|
Guide: Gemma 4 31B on DGX Spark via NemoClaw — Dual-Model Setup Guide
|
|
3
|
844
|
April 10, 2026
|
|
Running GLM-4.7-FP8 (355B MoE) on 4x DGX Spark with SGLang + EAGLE Speculative Decoding
|
|
38
|
1506
|
April 10, 2026
|
|
ONNX Runtime GPU inference on DGX Spark (GX10) — build guide and prebuilt binaries
|
|
0
|
99
|
April 10, 2026
|
|
Qwen3.5-122B-A10B NVFP4 Quantized for DGX Spark — 234GB → 75GB, Runs on 128GB
|
|
44
|
8255
|
April 9, 2026
|
|
Spark and vllm
|
|
0
|
110
|
April 9, 2026
|
|
New pre-built sglang Docker Images for NVIDIA DGX Spark
|
|
22
|
1521
|
April 9, 2026
|
|
DGX Spark Model Manager — Open Source Web UI for Ollama, SGLang & LiteLLM
|
|
5
|
391
|
April 9, 2026
|
|
OpenClaw + Ollama hybrid + ClawMobile architecture
|
|
6
|
190
|
April 8, 2026
|
|
vLLM custom for DGX Spark - STREAM LOADING and automatic KV cache
|
|
10
|
384
|
April 8, 2026
|
|
NeuralForge GPU Native Knowledge Intelligence Platform Built on DGX Spark GB10
|
|
1
|
120
|
April 8, 2026
|
|
DGX Spark GB10 / vLLM 0.19.1: TurboQuant KV cache integration results on Qwen3.5 and Nemotron, including gather-free Triton decode and CUDA WPH decode
|
|
5
|
987
|
April 7, 2026
|
|
Sparkrun - central command with tab completion for launching inference on Spark Clusters
|
|
60
|
1677
|
April 6, 2026
|
|
Gemma4 Benchmarks on double DGX Sparks Ray Cluster and single DGX
|
|
2
|
492
|
April 6, 2026
|
|
Trinity-Large-Thinking should fit in 2 Sparks
|
|
3
|
331
|
April 6, 2026
|