I suspect it was related to running the NVIDIA DGX SPARK diagnostic program yesterday. I just unplugged the power adapter from the power strip, waited for a few minutes, and then restarted the system for testing. It has basically returned to normal. (We unlocked NVFP4 on the DGX Spark: 20% faster than AWQ! - #91 by cho)
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| PSA: State of FP4/NVFP4 Support for DGX Spark in VLLM | 231 | 10094 | April 21, 2026 | |
| We unlocked NVFP4 on the DGX Spark: 20% faster than AWQ! | 145 | 6896 | March 28, 2026 | |
| What's the best speed we can get with Qwen 3.6 27B without quantizing? | 26 | 4554 | April 24, 2026 | |
| RedHatAI/Qwen3.5-122B-A10B-NVFP4 seems to be the best option for a single Spark | 74 | 4860 | April 11, 2026 | |
| Best Q4 / NVFP4 model for quality Qwen3.5-27B or alternatives? | 16 | 1393 | April 26, 2026 | |
| Qwen3.5-122B-A10B NVFP4 Quantized for DGX Spark — 234GB → 75GB, Runs on 128GB | 44 | 9330 | April 9, 2026 | |
| From 20 to 35 TPS on Qwen3-Next-NVFP4 w/ FlashInfer 12.1f | 10 | 1571 | January 7, 2026 | |
| New bleeding-edge vLLM Docker Image: avarok/vllm-nvfp4-gb10-sm120 | 35 | 2881 | December 31, 2025 | |
| Two-Spark cluster with vLLM using tensor-parallel-size 2 causes one node to drop while the other's GPU goes 100% forever | 36 | 1292 | February 13, 2026 | |
| Qwen/Qwen3.5-122B-A10B - Alibaba/Qwen thought about us... :-D | 340 | 15159 | March 24, 2026 |