I have dgx spark and like to run cosmos predict 2.5 inference.
Is there an example of how to do this?
Is there a docker container with cosmos predict 2.5 installed?
Thanks,
I have dgx spark and like to run cosmos predict 2.5 inference.
Is there an example of how to do this?
Is there a docker container with cosmos predict 2.5 installed?
Thanks,
You can run Cosmos Predict 2.5 inference on DGX Spark either by using the official Docker workflow from the Cosmos docs or by cloning the GitHub repo and running the provided inference script; there is no “DGX‑Spark‑only” image, but you build and run a standard Cosmos Predict 2.5 container on Spark. docs.nvidia
docs/inference.md, which walks through running base models for text, image, and video inputs. githubThe standard workflow is to build and run the Cosmos Predict 2.5 container yourself, rather than pulling a prebuilt NVIDIA NGC image specifically for Spark. github
Typical steps from the installation docs: docs.nvidia
git clone https://github.com/nvidia-cosmos/cosmos-predict2.5.git (path from docs). github--ulimit flag the docs recommend):
docker build --ulimit nofile=131071:131071 -f Dockerfile . -t cosmos-predict-2.5 docs.nvidiadocker run --gpus all --rm -v .:/workspace -v /workspace/.venv -it cosmos-predict-2.5 docs.nvidiaInside the container you then follow the repo’s setup and inference guides (environment variables, model download, and the command‑line interface). github
HF_HOME). github/workspace when launching the container to keep I/O local to the DGX. docs.nvidiaWould you prefer a concrete example command line (with typical flags and volumes) tailored to a single DGX Spark node with, say, 4 or 8 GPUs, or are you planning to run multi‑node distributed inference?
Thanks for detailed instructions and I will try it later.
Do you have any idea about performance of predict 2.5 2B on dgx spark in terms of speed?
Thanks,
No sorry. I do not have any specific metrics like that.