vLLM >= 0.12 on DGX Spark?

mamiglia · December 16, 2025, 9:37am

Hi All! I am running vLLM on a DGX Spark and need to use vLLM version 0.12.0 to get support for the new Ministral 3 model family (Ministral-3-14B-Instruct, etc.).

The official NGC container (nvcr.io/nvidia/vllm:25.11-py3) currently supports vLLM only up to version 0.11, which predates the required model support.

Given that the DGX Spark is a newer platform and often requires the NVIDIA-optimized container for correct memory and hardware handling (specifically the sm_121a architecture), are there plans to release an updated official NGC container with vLLM>0.12.0 soon?

If an official image is not immediately available, could you provide the recommended steps or a canonical guide for building vLLM>0.12.0 from source specifically for the DGX Spark? I’m not sure how to do this without impacting the platform specific optimizations you have set in the container.

Thank you for your assistance!

christopher_owen · December 16, 2025, 10:26am

I am using the wonderful work from @eugr from here GitHub - eugr/spark-vllm-docker: Docker configuration for running VLLM on dual DGX Sparks.

@johnny_nv teased that there will be a new Nvidia NGC release next week which will also progress the state of the art. Run VLLM in Spark - #102 by johnny_nv

system · December 30, 2025, 10:27am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
I'd like to learn how to use the latest vLLM on DGX Spark DGX Spark / GB10 cuda	9	1651	November 29, 2025
vLLM container out of date for new models DGX Spark / GB10	10	1567	November 14, 2025
Please fix the official nvidia/vllm docker container DGX Spark / GB10	1	212	January 24, 2026
Issue with connection to 2 dgx sparks. vllm DGX Spark / GB10	4	154	November 30, 2025
New NGC vLLM container image (vllm:26.01-py3) DGX Spark / GB10 cudnn , dali	4	392	January 31, 2026
Some new development work for Qwen3 on the Spark DGX Spark / GB10	5	373	February 3, 2026
vLLM Container issue in DGX Spark DGX Spark / GB10	5	426	November 11, 2025
New pre-built vLLM Docker Images for NVIDIA DGX Spark DGX Spark / GB10	48	2694	February 13, 2026
Best practices for running llvm bench DGX Spark / GB10	2	110	December 21, 2025
Docker container image for recent vLLM release that enables GGUF loading Docker and NVIDIA Docker	3	719	October 29, 2025

vLLM >= 0.12 on DGX Spark?

Related topics