| model | test | t/s | peak t/s | ttfr (ms) | est_ppt (ms) | e2e_ttft (ms) |
|:-----------------------------------------|-------:|----------------:|-------------:|--------------:|---------------:|----------------:|
| nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4 | pp2048 | 4340.68 ± 36.00 | | 473.50 ± 3.90 | 471.85 ± 3.90 | 473.59 ± 3.91 |
| nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4 | tg32 | 41.20 ± 0.04 | 42.54 ± 0.05 | | | |
I just unplugged the power adapter and waited for about 5 minutes before restarting the DGX SPARK. Now the performance is back to normal, and I didn’t rebuild the image during this process.
I suspect it might be related to the NVIDIA DGX Spark Field Diagnostics I ran yesterday.