Nvidia Cosmos running on Jetson

johnnynunez · January 11, 2025, 9:29am

🚀 𝗧𝗵𝗿𝗶𝗹𝗹𝗲𝗱 𝘁𝗼 𝗮𝗻𝗻𝗼𝘂𝗻𝗰𝗲 𝗮 𝗺𝗮𝗷𝗼𝗿 𝗺𝗶𝗹𝗲𝘀𝘁𝗼𝗻𝗲 𝗶𝗻 𝗺𝘆 𝗷𝗼𝘂𝗿𝗻𝗲𝘆 𝘄𝗶𝘁𝗵 𝗡𝗩𝗜𝗗𝗜𝗔 𝗖𝗼𝘀𝗺𝗼𝘀™! 🌌⁣

⁣

I successfully ported the revolutionary NVIDIA 𝗖𝗼𝘀𝗺𝗼𝘀™ 𝗽𝗹𝗮𝘁𝗳𝗼𝗿𝗺 to the 𝗝𝗲𝘁𝘀𝗼𝗻 𝗔𝗚𝗫 𝗢𝗿𝗶𝗻, along with the Transformer Engine, making both fully containerized with Docker for a true plug-and-play experience. 𝗖𝗼𝘀𝗺𝗼𝘀 is a groundbreaking platform of generative world foundation models (W𝗙𝗠𝘀), advanced tokenizers, and an accelerated data processing pipeline, purpose-built to advance Physical AI in autonomous vehicles and robotics.⁣

⁣

This work was recently showcased at CES 2025, where Cosmos took center stage as a transformative technology for developers and industries worldwide. With the port to Jetson AGX Orin, we’re unlocking the power of Cosmos and the 𝗧𝗿𝗮𝗻𝘀𝗳𝗼𝗿𝗺𝗲𝗿 𝗘𝗻𝗴𝗶𝗻𝗲 for edge applications, allowing developers to leverage its physics-based synthetic data generation, model fine-tuning capabilities, and highly efficient inference on compact, efficient systems.⁣

⁣

𝗣𝗼𝗿𝘁𝗶𝗻𝗴 𝗖𝗼𝘀𝗺𝗼𝘀 𝗮𝗻𝗱 𝘁𝗵𝗲 𝗧𝗿𝗮𝗻𝘀𝗳𝗼𝗿𝗺𝗲𝗿 𝗘𝗻𝗴𝗶𝗻𝗲 wasn’t just about integration—it’s about empowering developers to harness the future of AI-driven robotics and autonomous systems. With its modular, scalable design, Cosmos is a key enabler for innovation, helping the industry address challenges like data scarcity and variability through synthetic environments that are both photoreal and physics-based.⁣

⁣

Let’s shape the future of Physical AI together! Feel free to connect, collaborate, and share your insights on this exciting journey. 🚀⁣

⁣

#NVIDIA cosmos #NvidiaCosmos #TransformerEngine #PhysicalAI #GenerativeAI #JetsonAGXOrin robotics edgecomputing #AutonomousVehicles ai #Innovation #CES2025 docker #SyntheticData⁣

kalustian · January 13, 2025, 2:04pm

Hello Johnny

Can you please share step by step how you install it on the AGX orin ( I have the 32Gb version on Jetpack 6.1 ) ?

NOTE: When following and running from below page., the process got “killed”

github.com

NVIDIA/Cosmos/blob/main/INSTALL.md

# Cosmos Installation

We have only tested the installation with Ubuntu 24.04, 22.04, and 20.04.

1. Install the [NVIDIA Container Toolkit](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html).

2. Clone the repository.

```bash
git clone git@github.com:NVIDIA/Cosmos.git
cd Cosmos
```

3. Build a Docker image using `Dockerfile` and run the Docker container.

```bash
docker build -t cosmos .
docker run -d --name cosmos_container --gpus all --ipc=host -it -v $(pwd):/workspace cosmos
docker attach cosmos_container
```

Thanks

johnnynunez · January 13, 2025, 3:03pm

Using jetson containers and my docker:

https://hub.docker.com/r/johnnync/r36.4.0-cu126-cp310-cosmos

kalustian · January 13, 2025, 3:21pm

I am downloading the Cosmos from the Docker page now for testing. But I can not find it on the Dusty github portal. How I should pull it from the Dusty container page ?

Thanks

johnnynunez · January 13, 2025, 3:21pm

because, he still is working in other promising things @dusty_nv

kalustian · January 13, 2025, 3:27pm

ok , no problem. I finished downloading it from the Docker page. I am running it as sudo docker run johnnync/r36.4.0-cu126-cp310-cosmos, but it does not start the Cosmos container. During installation no errors found. Any suggestion to how properly run it ?

johnnynunez · January 13, 2025, 3:30pm

jetson-containers run -it -v $(pwd):/workspace johnnync/r36.4.0-cu126-cp310-cosmos (my docker)

in pwd it is pointing to Cosmos clone, because if you download model inside docker, if it break, you lost the models… and the process to download them is very large.

also @shahizat replicates my process. Can you help him?

shahizat · January 13, 2025, 3:34pm

Hello @kalustian

You can use this command also. I confirm that @johnnynunez’s container image works.

docker run --runtime nvidia -it --rm -v ./cosmos:/models --network=host johnnync/r36.4.0-cu126-cp310-cosmos:latest

kalustian · January 13, 2025, 3:35pm

checking…

kalustian · January 13, 2025, 3:41pm

Hi shahizat

I run Johnny’s and your command and I could be able to get into the container. Once in there I typed the following commands:

PROMPT=“A sleek, humanoid robot stands in a vast warehouse filled with neatly stacked cardboard boxes on industrial shelves.
The robot’s metallic body gleams under the bright, even lighting, highlighting its futuristic design and intricate joints.
A glowing blue light emanates from its chest, adding a touch of advanced technology. The background is dominated by rows of boxes,
suggesting a highly organized storage system. The floor is lined with wooden pallets, enhancing the industrial setting.
The camera remains static, capturing the robot’s poised stance amidst the orderly environment, with a shallow depth of
field that keeps the focus on the robot while subtly blurring the background for a cinematic effect.”

PYTHONPATH=$(pwd) python cosmos1/models/diffusion/inference/text2world.py
–checkpoint_dir checkpoints
–diffusion_transformer_dir Cosmos-1.0-Diffusion-7B-Text2World
–prompt “$PROMPT”
–offload_prompt_upsampler
–video_save_name Cosmos-1.0-Diffusion-7B-Text2World

and I got this :

“Python not found”

shahizat · January 13, 2025, 3:48pm

@kalustian please use this commands below, mount docker using -v option to download the models there:

Firstly, download the model:

PYTHONPATH=/opt/Cosmos python3 /opt/Cosmos/cosmos1/scripts/download_diffusion.py --model_sizes 7B 14B --model_types Text2World Video2World

Then run

PROMPT="The video is a dynamic and immersive driving experience captured from the perspective of a car's dashboard camera, likely mounted on the windshield. The setting is a narrow, two-lane road surrounded by lush greenery, suggesting a scenic route through a forested area. The road is marked with a single yellow line in the center, indicating a one-way traffic direction. The camera remains mostly static, providing a consistent view of the road ahead, while the car moves swiftly around a sharp curve to the right. The surroundings are dense with tall trees, and the road is flanked by a guardrail on the left side, which adds to the sense of speed and adventure. The weather appears overcast, with a misty atmosphere that enhances the feeling of being enveloped in nature. The car's speed is evident from the blurred background and the consistent motion of the road's edge. The video captures the thrill of driving through a picturesque landscape, emphasizing the connection between the driver and the natural environment. The camera's perspective remains focused on the road, with no visible pedestrians or other vehicles, creating an uninterrupted driving experience."

and finally run:

PYTHONPATH=$(pwd) python3 cosmos1/models/diffusion/inference/text2world.py  \
    --checkpoint_dir /models/checkpoints \
    --diffusion_transformer_dir Cosmos-1.0-Diffusion-7B-Text2World \
    --prompt "$PROMPT" \
    --video_save_name /models/New_Cosmos-1.0-Diffusion-7B-Text2World_memory_efficient \
    --offload_prompt_upsampler \
    --offload_tokenizer \
    --offload_diffusion_transformer

You can also join our discord channel: Jetson AI Lab Research Group Community

kalustian · January 13, 2025, 3:49pm

Please allow me 10-15 min to test drive …will provide feedback soon

johnnynunez · January 13, 2025, 4:34pm

change python to python3.
Also use all offload models.
With jetson thor will can execute every model on memory

kalustian · January 13, 2025, 11:00pm

Environment:

JetPack 6.1
AGX orin 32Gb RAM
SWP increased from 15Gb to 55Gb
CPU/GPU clocked setup at max speed: 2.2Ghz / 1.3Ghz

Here are the steps I have taken (thanks to Johnny and Shahizat)

1)Run the Docker:
$sudo docker run --runtime nvidia -it --rm -v ./cosmos:/models --network=host johnnync/r36.4.0-cu126-cp310-cosmos:latest

2) Install and login in HuuginFace:
pip install -U “huggingface_hub[cli]”

3) Download the model:
PYTHONPATH=/opt/Cosmos python3 /opt/Cosmos/cosmos1/scripts/download_diffusion.py --model_sizes 7B --model_types Text2World Video2World

4) Add a Prompt

PROMPT=“A sleek, humanoid robot stands in a vast warehouse filled with neatly stacked cardboard boxes on industrial shelves.
The robot’s metallic body gleams under the bright, even lighting, highlighting its futuristic design and intricate joints.
A glowing blue light emanates from its chest, adding a touch of advanced technology. The background is dominated by rows of boxes,
suggesting a highly organized storage system. The floor is lined with wooden pallets, enhancing the industrial setting.
The camera remains static, capturing the robot’s poised stance amidst the orderly environment, with a shallow depth of
field that keeps the focus on the robot while subtly blurring the background for a cinematic effect.”

or

PROMPT=“The video is a dynamic and immersive driving experience captured from the perspective of a car’s dashboard camera, likely mounted on the windshield. The setting is a narrow, two-lane road surrounded by lush greenery, suggesting a scenic route through a forested area. The road is marked with a single yellow line in the center, indicating a one-way traffic direction. The camera remains mostly static, providing a consistent view of the road ahead, while the car moves swiftly around a sharp curve to the right. The surroundings are dense with tall trees, and the road is flanked by a guardrail on the left side, which adds to the sense of speed and adventure. The weather appears overcast, with a misty atmosphere that enhances the feeling of being enveloped in nature. The car’s speed is evident from the blurred background and the consistent motion of the road’s edge. The video captures the thrill of driving through a picturesque landscape, emphasizing the connection between the driver and the natural environment. The camera’s perspective remains focused on the road, with no visible pedestrians or other vehicles, creating an uninterrupted driving experience.”

5) Run it:
PYTHONPATH=$(pwd) python3 cosmos1/models/diffusion/inference/text2world.py \
–checkpoint_dir checkpoints \
–diffusion_transformer_dir Cosmos-1.0-Diffusion-7B-Text2World
–prompt “$PROMPT” \
–video_save_name Cosmos-1.0-Diffusion-7B-Text2World_memory_efficient
–offload_tokenizer \
–offload_diffusion_transformer \
–offload_text_encoder_model \
–offload_prompt_upsampler \
–offload_guardrail_models

5) Success !!
After almost 3h @60 watts a 5 sec. video have been created. Special thanks to Johnny and Shahizat

Topic		Replies	Views
Issues running cosmos-reason1 on Jetson AGX orin Jetson AGX Orin containers , cosmos	10	252	November 10, 2025
NVIDIA Cosmos on AGX Orin issue Jetson AGX Orin cosmos	10	321	April 1, 2025
The Cosmos Tutorials on Jetson-AI-lab.com failes on Jetson AGX Orin 64 Jetson AGX Xavier jetson , cosmos	3	201	January 22, 2025
Maximize Robotics Performance by Post-Training NVIDIA Cosmos Reason Technical Blog cosmos	3	121	September 22, 2025
Advice on getting started with the Jetson Orin Nano Jetson Orin Nano jetson	4	293	September 25, 2025
Stable Diffusion on Jetson AGX Orin and Xavier Jetson Projects	10	8716	March 4, 2024
Cosmos-Reason2-2B running on Jetson Orin Nano Jetson Orin Nano llm , cosmos	2	90	February 24, 2026
LLaMa 2 LLMs w/ NVIDIA Jetson and textgeneration-web-ui Jetson Projects generative_ai	86	26168	May 10, 2024
Implementing Robotics Applications with ROS 2 and AI on the NVIDIA Jetson Platform Technical Blog	5	1358	July 15, 2021
NVIDIA® JETSON AGX ORIN™ 64G I need qwen chaGLM docker Jetson AGX Orin cuda , docker	2	329	May 13, 2024

Nvidia Cosmos running on Jetson

Related topics