Unable to use newest sglang on Jetson Orin 64GB

sihouzi21c · October 21, 2025, 8:56am

Sglang has updated to 0.5.4, I’ve tried three ways to upgrade but meets different errors:

I tried to search existing docker images and the newest image I found was 0.4.7 which is out-of-date: https://hub.docker.com/r/dustynv/sglang/tags

2. I tried to jetson-containers build sglang:0.5.3, but errors occur when it proceeds to build flashinfer, as current cuda is 12.6 while flashinfer yields 12.8

3. I tried to pip install sglang==0.5.3 in existing sglang docker, but there occurs error information no kernel image is available for execution on the device as Orin is SM87

Is there any suggestion to deal with these problems? I’d greatly appreciate your time and help. Thank you in advance for your support!

sihouzi21c · October 21, 2025, 9:07am

Additionally, I tried to search existing whl packages at https://pypi.jetson-ai-lab.io/jp6/cu129/+simple/sgl-kernel/ and other cuda versions, and these packages not that sufficient, as my python version is 3.12.

carolyuu · October 21, 2025, 9:15am

*** Please note that this reply is generated by LLM automatically ***

I’d be happy to help you with your issue!

It appears that you’re trying to update Jetson AGX Orin’s Sglang to version 054, but you’re encountering errors with different approaches. Let’s break down each of your attempts and provide suggestions:

Attempt 1: Searching for existing Docker images

You’ve searched for the latest Docker image on Docker Hub, but the newest image available is version 047, which is outdated. This approach won’t work since the image is not up-to-date.

Attempt 2: Building from source

You’ve tried to build Sglang from source, but you’re encountering errors due to CUDA version incompatibility. The current CUDA version is 11.6, while the build process requires CUDA 12.8. This approach won’t work due to the version mismatch.

Attempt 3: Updating in existing container

You’ve tried to update Sglang in an existing container, but you’re encountering an error related to the Orin architecture. This approach won’t work since the update process is not compatible with the Orin architecture.

Suggestions

Wait for an updated Docker image: You can keep an eye on the Docker Hub repository for an updated image that matches your required version (054). This might take some time, but it’s the easiest way to get the latest version.
Build from source with compatible CUDA version: You can try building Sglang from source, but you’ll need to ensure that you have the correct CUDA version (12.8) installed on your system. This might require updating your CUDA installation or using a different build environment.
Use a different update method: If updating in an existing container is not possible, you can try using a different update method, such as using apt-get or pip to update Sglang. However, this might require additional dependencies or configuration.
Contact the Sglang maintainers: If none of the above approaches work, you can try reaching out to the Sglang maintainers or the NVIDIA developer community for assistance. They might be able to provide guidance or a temporary workaround.

In summary, the best approach would be to wait for an updated Docker image or build from source with the correct CUDA version. If these options are not feasible, you can try alternative update methods or seek assistance from the Sglang maintainers or the NVIDIA developer community.

*** Please give the thumbs up if you get answers you like. Or provide feedback to help us improve the answer. ***

AastaLLL · October 22, 2025, 4:37am

Hi,

Have you tried the latest one?

dustynv/sglang:r36.4-cu129-24.04

Thanks.

sihouzi21c · October 22, 2025, 6:50am

Thanks for your reply. I’ve tried dustynv/sglang:r36.4-cu128-24.04, whose publish time close to the image you mentioned. I’ll try dustynv/sglang:r36.4-cu129-24.04 and report the result.

sihouzi21c · October 22, 2025, 11:22am

hi, I’ve pulled dustynv/sglang:r36.4-cu129-24.04 and the sglang version inside is 0.4.9, falling behind the current up-to-date version 0.5.3. Is there any further suggestion?

sihouzi21c · October 24, 2025, 4:06am

Follow the suggestion of D-G-Dimitrov in the following link https://github.com/dusty-nv/jetson-containers/pull/1470, I’ve pulled the corresponding docker image and solved this problem, thank you, AastaLLL and D-G-Dimitrov!

system · November 18, 2025, 9:04am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
SOTA inference speed using SGlang and EAGLE-3 speculative decoding on the NVIDIA Jetson AGX Orin Jetson Projects llama-31-8b-instruct , llama	2	1040	March 23, 2025
Build SGLang from source on Blackwell Pro 6000/ DGX Spark DGX Spark / GB10 jetson , nemotron	13	332	February 18, 2026
Run SGLang in Thor Jetson Thor	14	1169	December 2, 2025
Setting up multiple instances of the SGLang server using router on the NVIDIA Jetson AGX Orin 64GB dev kit Jetson Projects	0	408	June 9, 2025
Docker image for Jetson AGX Orin with CUDA environment Jetson AGX Orin cuda , docker , containers	5	4453	June 6, 2024
Run SGLang in Spark DGX Spark / GB10	20	1792	November 28, 2025
New pre-built sglang Docker Images for NVIDIA DGX Spark DGX Spark / GB10 Projects	4	295	February 19, 2026
Jetson AGX ORIN - upgrade and update Jetson AGX Orin reflash	3	131	October 8, 2025
Running SGLang Diffusion Inference DGX Spark / GB10	3	146	January 27, 2026
CUDA driver version is insufficient for CUDA runtime version Jetson AGX Orin cuda , containers , jetson	4	976	July 3, 2024

Unable to use newest sglang on Jetson Orin 64GB

Related topics