Please fix the official nvidia/vllm docker container

vgoklani · January 24, 2026, 5:17am

Is it possible to update the official nvidia/vllm Docker container? The latest image appears to be pinned to vLLM v0.11, which is quite outdated. I opened an issue on the vLLM GitHub repo but didn’t get a response, so I’m posting here in hopes someone from NVIDIA sees this and can help get it fixed. Thanks!

cosinus · January 24, 2026, 9:18am

The NVIDIA containers for vLLM are updated on a monthly basis. Sometimes with minor updates in between.

But you won’t see v0.14.0 anytime soon. NVIDIA is more interested in stability as latest features as it seems. And I assume each new release of vLLM will be thoroughly tested as it needs not only to run on a GB10, but across all relevant GPUs, especially on their big irons.

You could go with the official vLLM images, but the best way to unlock the full potential of your Spark(s) is currently the solution provided by eugr as he also tries to incorporate changes / patches before they hit the official vLLM build.

As you can see here:

If you build these containers with the --use-wheels switch, it saves time (quite fast) and nerves. And it allows you also to use latest transformers if needed like in this example for a bleeding edge model like GLM 4.7 Flash. Something that is not yet available with the current vLLM release.

Topic		Replies	Views
Docker container image for recent vLLM release that enables GGUF loading Docker and NVIDIA Docker	3	719	October 29, 2025
New NGC vLLM container image (vllm:26.01-py3) DGX Spark / GB10 cudnn , dali	4	392	January 31, 2026
vLLM container out of date for new models DGX Spark / GB10	10	1568	November 14, 2025
vLLM >= 0.12 on DGX Spark? DGX Spark / GB10 cuda , containers , llm , dgx	2	302	December 16, 2025
I'd like to learn how to use the latest vLLM on DGX Spark DGX Spark / GB10 cuda	9	1651	November 29, 2025
New pre-built vLLM Docker Images for NVIDIA DGX Spark DGX Spark / GB10	48	2696	February 13, 2026
GLM-4.7-Flash-NVFP4 was just released, but for Transformers 5.0 + vLLM 0.14...? DGX Spark / GB10	89	3175	February 13, 2026
vLLM Container issue in DGX Spark DGX Spark / GB10	5	426	November 11, 2025
vLLM container 25.10-py3 fails to start Jetson Thor nvbugs , generative_ai	13	528	December 8, 2025
Some new development work for Qwen3 on the Spark DGX Spark / GB10	5	375	February 3, 2026

Please fix the official nvidia/vllm docker container

Related topics