We unlocked NVFP4 on the DGX Spark: 20% faster than AWQ!


| model                                    |   test |             t/s |     peak t/s |     ttfr (ms) |   est_ppt (ms) |   e2e_ttft (ms) |
|:-----------------------------------------|-------:|----------------:|-------------:|--------------:|---------------:|----------------:|
| nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4 | pp2048 | 4340.68 ± 36.00 |              | 473.50 ± 3.90 |  471.85 ± 3.90 |   473.59 ± 3.91 |
| nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4 |   tg32 |    41.20 ± 0.04 | 42.54 ± 0.05 |               |                |                 |

I just unplugged the power adapter and waited for about 5 minutes before restarting the DGX SPARK. Now the performance is back to normal, and I didn’t rebuild the image during this process.

I suspect it might be related to the NVIDIA DGX Spark Field Diagnostics I ran yesterday.

1 Like