How to run GLM 4.7 on dual DGX Sparks with vLLM / mods support in spark-vllm-docker

Unfortunately, If you don’t use 2 sparks, you’re basically wasting $1500, because you’re not using the infiniband module you already have. Also training scales linearly with cluster expansion.