Shoutout to Eugr

christopher_owen · February 23, 2026, 5:53pm

josephbreda · February 23, 2026, 6:13pm

Saw that. I strongly believe NVIDIA could build a great amount of goodwill with this community if they leveled @eugr up with two more sparks and the necessary switchgear and cables.

The support he has provided has dramatically increased the value of their offering. @johnny_nv any pull here?

raphael.amorim · February 23, 2026, 6:25pm

Totally support this. Where do I sign?

vedcsolution · February 23, 2026, 7:52pm

Without the support of this community, thanks a @eugr ., it wouldn’t be possible to test the latest models; everything would be much slower. I think this is a puzzle machine.

notmy.reward438 · February 23, 2026, 7:57pm

Yeah saw that also :D eugr became famous!

There are definitely at least a community members that deserve some love like @raphael.amorim also. Nvidia should hire them…

christopher_owen · February 23, 2026, 8:31pm

He was already famous in my eyes!

tyrelcb · February 23, 2026, 8:39pm

Sometimes if you work for a company your hands and tongue are tied… so to speak. Non-biased info is always good. Even though all info usually has some form of bias. They should send him 8 of these, enable infiniband on the connectx7 interface, and provide a quality switch. Maybe about a ~ $64k setup.

eugr · February 23, 2026, 10:48pm

Thanks guys!

At this point, it’s not just “my” project anymore, there is a whole community around it - some contribute to the repository directly (e.g. @raphael.amorim) , some indirectly (like @christopher_owen), and others by submitting new issues, flagging new models/workarounds in the forums, etc…

tbraun96 · February 23, 2026, 10:52pm

@eugr is the celeb on this forum. He’s literally flying sky-high on a helicopter. I second Nvidia sending bulk dgx’s to him, but, rabbit hole investigations show Nvidia has artificially bottlenecked the DGX Spark to separate the $4000 tier from the $10000 tier RTX 6000 users. I am reserved as to whether or not they would do so, although, I have seen them send DGX Sparks to podcasters who just learned what the echo command does,

agustinr · February 23, 2026, 10:59pm

I saw the video! I also support some sparks to eugr :)

Alex’s video sent me over the edge. I went and purchased a second GX10 last night. And now thanks to Eugr’s repo, I have MM M2.5 running. Amazing.

eugr · February 23, 2026, 11:03pm

I tried, and decided to stick to the airplanes :) My instructor had a good laugh though, lol.

eugr · February 23, 2026, 11:05pm

Funny thing, but the reason I purchased the second Spark originally was because no one would test inference on a cluster :) That was back in November. Turned out that it not only works fairly well, but makes Spark much more useful.

raphael.amorim · February 23, 2026, 11:08pm

I remember those times. What a long thread. LOL
When I got my second Spark most of the fun times were gone.

flash3 · February 24, 2026, 11:43am

cool.

have you noticed the white rabbit?

AoE · February 24, 2026, 12:41pm

2nd GB10 box arrived this morning. Would probably not have done it so soon if @eugr didn’t give us that spark-vllm-docker repo to test strategies with :)

raphael.amorim · February 24, 2026, 1:23pm

The rabbit is waiting for the right quantization 😂

christopher_owen · February 24, 2026, 1:27pm

Follow the white rabbit.

parad8x010 · February 25, 2026, 10:21am

Really nice work — I genuinely enjoy your channel and it’s always a pleasure to watch your updates. That said, I’m not fully sure why you chose this specific quantization and stack.

I’m also trying to understand the practical point of the benchmark as it stands, because I’m getting ~81 TPS on llama.cpp on my side, which is clearly higher than what your test shows even on 4 Sparks. So it feels like we need to reframe the experiment: either rerun it on EXO, or try alternative tools + different quants to see what the real takeaway is and whether the Spark setup actually delivers a meaningful advantage in real-world inference.

i have (17GB, MXFP4, 1M) [256K, 🟢81.3 tok/s ⭐9.7(#2) 🏆] - Qwen3-VL 30B - on 1 spark.

christopher_owen · February 25, 2026, 11:11am

I think he used full precision weights but you are mentioning mxfp4. This could definitely lead to the TPS difference.

eugr · February 25, 2026, 4:50pm

He used different models in the test, not sure which one you are talking about. I don’t think he used Qwen3-VL-30B, he used Qwen3-VL-32B (in BF16), which is a dense model and has 32B active parameters (vs. 3B active parameters for Qwen3-VL-30B). The 32B model is very slow even on dual sparks (unless you go to 4 bit quants, then it performs at around 21 t/s on two sparks (12 t/s on one).

Topic		Replies	Views
Eugr joins NVIDIA Spark Team! DGX Spark / GB10 llama	108	4023	June 24, 2026
Should we as a community gofundme one Spark for Eugr's nightly builds? DGX Spark / GB10	51	1717	April 1, 2026
NVFP4 on DGX Spark / GB10 is broken. I bought 9 of these for this feature. Requesting NVIDIA's official roadmap and response DGX Spark / GB10 jetson , llama , agentic-ai , nemotron , nemoclaw	44	5945	May 17, 2026
What a missed opportunity for nvidia DGX Spark / GB10	22	1262	March 30, 2026
A Spark to beat M5 Ultra and a MegaSpark to beat 2x Rubin PRO 6000! DGX Spark / GB10 nemotron	45	1910	June 24, 2026
New DGX Spark purchased. But I can't add it to my Nvidia profile DGX Spark / GB10	1	233	December 18, 2025
DGX Spark (SM121) Software Support is Severely Lacking - Official Roadmap Needed DGX Spark / GB10	41	5393	February 15, 2026
Dearest CUTLASS TEAM, When the hell are you going to properly fix tcgen05 FP4 support for DGX Spark / GB10 (SM121)? DGX Spark / GB10	37	2422	April 25, 2026
DGX Spark by far the best inference (at the edge) option? DGX Spark / GB10 edgeai	2	1075	January 21, 2026
I have ordered a second unit. Don't know why my friends say I'm stupid DGX Spark / GB10	47	3256	May 25, 2026

Shoutout to Eugr

Related topics