Excited to release this!
tenari
167
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Introducing PrismaScout -- PrismaQuant v2! | 87 | 5264 | June 9, 2026 | |
| What's the best speed we can get with Qwen 3.6 27B without quantizing? | 30 | 14514 | June 7, 2026 | |
| Qwen3.6-27B is out! | 284 | 24301 | June 3, 2026 | |
| Qwen/Qwen3.5-122B-A10B - Alibaba/Qwen thought about us... :-D | 340 | 16521 | March 24, 2026 | |
| Qwen3.5-122B-A10B NVFP4 Quantized for DGX Spark — 234GB → 75GB, Runs on 128GB | 44 | 10955 | April 9, 2026 | |
| RedHatAI/Qwen3.5-122B-A10B-NVFP4 seems to be the best option for a single Spark | 75 | 6109 | May 4, 2026 | |
| PSA: State of FP4/NVFP4 Support for DGX Spark in VLLM | 234 | 12632 | May 15, 2026 | |
| Qwen/Qwen3.6-35B-A3B (and FP8) has landed | 308 | 25826 | June 9, 2026 | |
| Qwen3.5 27B optimisation thread starting at 30+ t/s TP=1 | 23 | 2695 | May 11, 2026 | |
| Qwen3.5-122B-A10B on single Spark: up to 51 tok/s (v2.1 — patches + quick-start + benchmark) | 417 | 19673 | June 9, 2026 |