Has anyone seen this? The benchmarks look impressive. GLM 5.1 performance for half the param count. 397B works well on 2 Sparks, so I think this should work too. The lab feels sketchy though. It seems to be some sort of AI focused private Chinese university. There’s only an MLX quant available for now.
They also have a Mini version that’s based on 35B A3B, with benchmarks matching the 27B. Not sure if I buy it.