How to run cosmos 1.0 7b text2world model on six 6000 Ada GPU Cards (each of which has 48GB memory)

jmren · March 11, 2025, 3:28am

Hi,

We’re trying to run cosmos 1.0 7b text2world model on six 6000 Ada GPU Cards (each of which has 48GB memory), but the results show
"
torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 128.00 MiB. GPU 0 has a total capacity of 47.51 GiB of which 8.81 MiB is free. Process 8224 has 47.48 GiB memory in use. Of the allocated memory 46.99 GiB is allocated by PyTorch, and 12.60 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation.
"

github.com/NVIDIA/Cosmos

cosmos1/models/diffusion/README.md

main

# Cosmos Diffusion-based World Foundation Models

## Table of Contents
- [Getting Started](#getting-started)
  - [Set Up Docker Environment](#set-up-docker-environment)
  - [Download Checkpoints](#download-checkpoints)
- [Usage](#usage)
  - [Model Types](#model-types)
  - [Single and Batch Generation](#single-and-batch-generation)
  - [Sample Commands](#sample-commands)
    - [Text2World](#text2world-text2worldpy-7b-and-14b)
    - [Video2World](#video2world-video2worldpy-7b-and-14b)
  - [Arguments](#arguments)
    - [Common Parameters](#common-parameters)
    - [Text2World Specific Parameters](#text2world-specific-parameters)
    - [Video2World Specific Parameters](#video2world-specific-parameters)
  - [Safety Features](#safety-features)
  - [Prompting Instructions](#prompting-instructions)

This page details the steps for using the Cosmos diffusion-based world foundation models.

This file has been truncated. show original

We used the following example to run our case.

PROMPT=“A sleek, humanoid robot stands in a vast warehouse filled with neatly stacked cardboard boxes on industrial shelves.
The robot’s metallic body gleams under the bright, even lighting, highlighting its futuristic design and intricate joints.
A glowing blue light emanates from its chest, adding a touch of advanced technology. The background is dominated by rows of boxes,
suggesting a highly organized storage system. The floor is lined with wooden pallets, enhancing the industrial setting.
The camera remains static, capturing the robot’s poised stance amidst the orderly environment, with a shallow depth of
field that keeps the focus on the robot while subtly blurring the background for a cinematic effect.”

Example using 7B model

PYTHONPATH=$(pwd) python cosmos1/models/diffusion/inference/text2world.py
–checkpoint_dir checkpoints
–diffusion_transformer_dir Cosmos-1.0-Diffusion-7B-Text2World
–prompt “$PROMPT”
–offload_prompt_upsampler
–video_save_name Cosmos-1.0-Diffusion-7B-Text2World

We’re sure we have enough gpu memory to run this model,
Any comments to fix this problem are highly appreciated.

Topic		Replies	Views
Tutorial: Running Cosmos-1.0-Diffusion-7B on Two NVIDIA Orin AGX Devices Jetson Projects jetson , generative_ai , jetson-orin , cosmos	3	212	March 11, 2025
OutOfMemoryError CUDA Programming and Performance cuda , pytorch	1	814	March 13, 2024
torch.OutOfMemoryError: CUDA out of memory when training model Linux pytorch , ai-training , training , natural-language-processing-nlp , ai-model-training	0	627	January 6, 2025
Cosmos on RTX 5080 Linux rtx , cosmos	0	50	June 2, 2025
Limit tortoise-tts to less than 2GB memory? CUDA Programming and Performance	12	353	August 3, 2024
NVIDIA Cosmos on AGX Orin issue Jetson AGX Orin cosmos	10	133	April 1, 2025
Cuda out of memory error CUDA Programming and Performance	1	1060	December 13, 2023
Triton CUDA error: out of memory cuDNN inference-server-triton	1	1538	August 21, 2023
GPU Cuda out of memory error CUDA Programming and Performance gpu , gpu-computing	2	1391	July 7, 2023
ChatRTX with Gemma 7B and Lamma2 13B AI Foundation Models and Endpoints	1	445	May 20, 2024

How to run cosmos 1.0 7b text2world model on six 6000 Ada GPU Cards (each of which has 48GB memory)

Example using 7B model

Related topics