NVIDIA Developer Forums

DGX Spark PyTorch LLM training throughput up to 8x slower than expected

Accelerated Computing DGX Spark / GB10 User Forum DGX Spark / GB10

aniculescu April 2, 2026, 6:56pm 5

Please check out our benchmarking guide for benchmarking different models with different backends: DGX Spark Performance FAQ

Topic		Replies	Views	Activity
DGX Spark + Qwen3-Next-80B: Proven Performance, But Missing Clear Path to NIM, TensorRT-LLM & Web UIs DGX Spark / GB10 cuda , nim , llama	16	4053	March 6, 2026
DGX Spark performance DGX Spark / GB10	50	4442	February 27, 2026
Dgx spark benchmark performance DGX Spark / GB10	17	1993	January 4, 2026
TensorRT-LLM + nvidia/Llama-3.3-70B-Instruct-NVFP4 = 5 tok/s DGX Spark / GB10 llama	4	602	February 1, 2026
NVIDIA folks -- where is this promised nvfp4 speedup? DGX Spark / GB10	27	2576	March 26, 2026
Question on Inference Performance Results of Qwen3 235B A22B on 2× DGX Spark DGX Spark / GB10 cuda	5	709	December 19, 2025
We unlocked NVFP4 on the DGX Spark: 20% faster than AWQ! DGX Spark / GB10	145	7012	March 28, 2026
6x Spark setup DGX Spark / GB10	112	8606	April 25, 2026
Qwen3.5-122B-A10B NVFP4 Quantized for DGX Spark — 234GB → 75GB, Runs on 128GB DGX Spark / GB10 Projects	44	9465	April 9, 2026
DGX Spark: The Sovereign AI Stack — Dual-Model Architecture for Local Inference DGX Spark / GB10 Projects docker , spark , llm	9	1662	February 13, 2026