New MoE, perfect fit for DGX Spark? mistralai/Leanstral-2603

stefan132 · March 16, 2026, 7:01pm

Anyone tried mistralai/Leanstral-2603 · Hugging Face ?

It seems to be one model of the new Mistral 4 family of models.

Seems to be a perfect fit for DGX spark. Hope vLLM will run it soon. @eugr

cosinus · March 16, 2026, 8:04pm

It just has landed. 😂

First MoE of Mistral after a long time. They also mention a new family of models. So maybe they will come in different sizes.

Wondering which quantization tool will support the new architecture. There is no config.json to check.

Unquantized it needs two Sparks and a custom vLLM version for now.

giraudremi92 · March 16, 2026, 8:07pm

I will try them this week on the spark :)

stefan132 · March 16, 2026, 8:10pm

I know. Life is short ;-)

eugr · March 16, 2026, 8:10pm

I’ll try it!

cosinus · March 16, 2026, 8:33pm

They announced just now a Nemotron coalition while presenting Nemo Claw. Mistral AI is also part of it. And Black Forest Labs. Looking forward to that.

carlos.albarran.mx · March 16, 2026, 8:38pm

There we go again!! Lets give it a try!!!

cosinus · March 16, 2026, 8:45pm

and another MoE - 119B parameters, with 6.5B activated per token.

Nice.

cosinus · March 16, 2026, 8:56pm

Tja.

cosinus@vroomfondel:~$ docker pull mistralllm/vllm-ms4:latest
latest: Pulling from mistralllm/vllm-ms4
no matching manifest for linux/arm64/v8 in the manifest list entries

Build your own… :-D

giraudremi92 · March 16, 2026, 9:02pm

@eugr we will need to be able to run it with vllm :)

arctic.gus · March 16, 2026, 9:24pm

Tried it on Nvidia NIM API, seems a little better than Qwen 3.5 122B on powershell based test, noticeably faster too, but not as good as OSS 120B (at least at powershell).

giraudremi92 · March 16, 2026, 9:44pm

On the benchmarks from Mistral.
Seems not so good as Qwen & Nemotron
Depends on what :

notmy.reward438 · March 18, 2026, 7:29pm

Yeah… personally, I found the Saudi and Korean models to be better, though it’s only been 1 day.

Topic		Replies	Views
Running Mistral Small 4 119B NVFP4 on NVIDIA DGX Spark (GB10) DGX Spark / GB10 deepseek	47	2349	April 2, 2026
New bleeding-edge vLLM Docker Image: avarok/vllm-nvfp4-gb10-sm120 DGX Spark / GB10 Projects	35	2661	December 31, 2025
NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 DGX Spark / GB10 nemotron	89	7642	March 31, 2026
DGX Spark, Nemotron3, and NVFP4: Getting to 65+ tps DGX Spark / GB10 spark , nemotron , dgx	14	1663	December 22, 2025
nvidia/Nemotron-Cascade-2-30B-A3B yet another model to test DGX Spark / GB10 nemotron	19	1248	March 24, 2026
Help: Running NVFP4 model on 2x DGX Spark with vLLM + Ray (multi-node) DGX Spark / GB10 mistral-large	18	2185	December 25, 2025
We unlocked NVFP4 on the DGX Spark: 20% faster than AWQ! DGX Spark / GB10	145	5890	March 28, 2026
DGX Spark performance DGX Spark / GB10	50	3694	February 27, 2026
Nemotron-3-Super 120B on GB10 — llama.cpp sm_121 build + Ollama GGUF incompatibility fix DGX Spark / GB10 Projects llama , nemotron	3	680	March 22, 2026
NeMo AutoModel → NIM: Export path for Qwen3-VL-30B-A3B MoE after LoRA training DGX Spark / GB10 nim , nemo-framework	0	24	March 19, 2026

New MoE, perfect fit for DGX Spark? mistralai/Leanstral-2603

Related topics