Qwen3.5-122B-A10B on single Spark: up to 51 tok/s (v2.1 — patches + quick-start + benchmark)

@bernardlbmi3 @stefan132 @paxren2020

I have made a separate thread to discuss work around and optimizations of Qwen3.5-35B-A3B.

Let’s continue at Qwen3.5-35B-A3B optimizations on single Spark