This is the configuration that worked
DeepSeek-V4-Flash (official FP8) running across 2x DGX Spark — TP=2, MTP, 200K ctx, recipe + numbers
bjk110
194
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Deepseek v4 Flash on 2 Nodes | 71 | 6574 | June 15, 2026 | |
| DeepSeek v4 Flash (Aiden Recipe from Reddit) - 1M token session operational, Cuda 12.1 tailored for DGX Spark GB10 | 278 | 13481 | July 3, 2026 | |
| DeepSeek-V4-Flash on 4× DGX Spark via vLLM (jasl fork, TP=4, RDMA, MTP) — 49–54 tok/s single-stream, full recipe + the traps | 3 | 544 | June 19, 2026 | |
| Deepseek V4 released | 143 | 16886 | May 18, 2026 | |
| DeepSeek V4 Flash (1,048,576 Context) on 2x DGX Spark – Custom Sparkrun Recipe | 11 | 908 | June 14, 2026 | |
| Fully custom CUDA-native Deepseek 4 Flash optimized for 1x Spark! antirez/ds4 | 77 | 7959 | June 28, 2026 | |
| DeepSeekV4-Flash hybrid quant, 1x DGX Spark: antirez's optimized 128 GB MLX recipe ported to vLLM for GB10 | 18 | 2047 | May 11, 2026 | |
| DeepSeek V4 Flash: Bringing Frontier AI to the Home | 11 | 3660 | May 17, 2026 | |
| Anyone having luck with Deepseek V4 Flash on Dual Sparks? | 13 | 1425 | June 4, 2026 | |
| DeepSeek v4 Flash (IQ2XXS) on a single GB10! | 13 | 4159 | July 2, 2026 |