you’re in the same ballpark, the only difference might be the VLLM version and the vLLM tune:
Introducing vLLM-Tune — Kernel tuning CLI for vLLM on DGX Spark
And also with:./build-and-copy.sh -t vllm-node --apply-vllm-pr 40898
Those two things improved my speed ever so slightly. My recipe is the exact same I pasted it here