Qwen3.5-122B-A10B on single Spark: up to 51 tok/s (v2.1 — patches + quick-start + benchmark)

As the patches are python only, we can probably integrate this as mod @eugr ? :)

Nice thank you for the effort! Will give this a try, as this model is usually my daily driver.

3 Likes