DGX Spark by far the best inference (at the edge) option?

stefan132 · January 20, 2026, 2:02pm

There is a lot of disappointment on the DGX Spark when it comes to inference.

But I think, DGX Spark is by far the best inference (at the edge) option for large models (>>100B) at the moment, e.g. for SMEs.
Where am I going wrong with that statement?

In scenarios for on prem inference for SMEs with limited budgets, there are several options like

RTX Pro 6000 Blackwell with 96GB
RTX 5090 32GB
Mac Ultras
DGX Spark
Strix Halo

Clustered DGX Spark (being around 3100 USD / Asus version, and a cluster of two sparks 6200 USD) runs Minimax M2.1 decently, other LLMs of similar size as well.

This would require two RTX Pro 6000. Including the computer it is almost 3 times the price. Yes, faster. But I could get nearly 6 sparks, i.e. 3 cluster that, at least with vLLM, scale well enough at decent speed.

Mac Studios with Ultra chips are great as well, but my scenario requires at least 256 GB RAM and 80 core GPU at around 7000 USD. Much worse prompt processing speed and scaling options. But good tg speed.

Strix Halo is way too slow and no scaling.

RTX 5090 is out of game, would need 5 or 6, which is crazy energy consumption and requires huge chassis.

Where am I going wrong ?

aniculescu · January 20, 2026, 10:34pm

Thank you for your review. We are glad you are liking the Spark. Please let us know if you have any more feedback.

fidecastro · January 21, 2026, 3:21am

This seems spot on. It’s one of the reasons I am excited with the DGX Spark platform.

Topic		Replies	Views
Reviews are coming in DGX Spark / GB10	27	7423	November 24, 2025
A Spark to beat M5 Ultra and a MegaSpark to beat 2x Rubin PRO 6000! DGX Spark / GB10 nemotron	50	2597	June 25, 2026
DGX Spark vs AMD Strix Halo DGX Spark / GB10 llama	4	9027	February 18, 2026
I have ordered a second unit. Don't know why my friends say I'm stupid DGX Spark / GB10	47	3371	May 25, 2026
NVIDIA DGX SPARK DGX Spark / GB10	15	2058	December 9, 2025
DGX Spark + RTX 3090 (any other GPU) --> DGX Spark Mini Station (DGX Sprak + (e)dGPU) DGX Spark / GB10 cuda , gaming , llama	6	1389	April 14, 2026
MSI EdgeXpert vs DGX Spark DGX Spark / GB10 performance	0	1756	November 26, 2025
How are you planning on using your DGX spark? DGX Spark / GB10 Projects	22	3277	February 24, 2026
Best Inference Framework & Open Models for Orchestrator-Workers Agentic Coding on GB10 + 5090 Hybrid? DGX Spark / GB10 llama , agentic-ai , deepseek	1	685	February 19, 2026
Why 273 GB/s? Less Is More, Until It Isn’t DGX Spark / GB10	67	2782	March 27, 2026

DGX Spark by far the best inference (at the edge) option?

Related topics