Hey everyone,
This is my first time posting here, and hope I keep doing so :)
I recently acquired an Asus Ascent GX10 and had to face its setup and configuration so it could serve LLM models for me. I work with a MacBook Pro, so I needed remote access.
Even though itβs a straightforward but repetitive process, it must be the same for anyone acquiring one of these magic boxes, so I decided to create spark.
spark is a single Bash script that handles the full DGX Spark lifecycle: initial setup, remote access, and model serving with vLLM β everything from your laptop.
GitHub: https://github.com/massimo92/spark
Install & setup
curl -fsSL https://raw.githubusercontent.com/massimo92/spark/main/install.sh | bash
spark setup # Guided wizard β configures your laptop AND the DGX over SSH
The setup wizard runs entirely from your Mac/Linux machine. It connects to the DGX via SSH and configures everything in one pass: system updates, GPU check, Docker, NGC auth, HuggingFace CLI, Tailscale for remote access, SSH keys, and the vLLM container.
Serve a model
spark pull RedHatAI/Qwen3.6-35B-A3B-NVFP4
spark run RedHatAI/Qwen3.6-35B-A3B-NVFP4
curl localhost:8000/v1/models
What it does differently
-
Auto-profiler β reads
config.jsonfrom the model and generates optimal vLLM flags automatically: reasoning parser (Qwen3, DeepSeek R1), tool-call parser, context length, multimodal detection, MoE architecture, and GPU memory utilization based on actual VRAM. -
Zero dependencies β single Bash script, no Python, no package manager. Works on any system with
bashandcurl. -
Remote-first β Tailscale integration so you can reach your DGX from anywhere. Setup disables password SSH after keys are configured.
-
Auto-update β checks for new spark CLI and NGC container versions once per day.
Available commands
spark setup # Guided wizard (runs from laptop, configures DGX over SSH)
spark run <model> # Serve a model with vLLM (auto-detects optimal flags)
spark stop # Stop the running model
spark pull <model> # Download from HuggingFace
spark list # List downloaded models with sizes
spark status # What's running
spark doctor # Check all prerequisites
spark update # Update NGC vLLM container
spark run supports --tools (tool calling), --text-only (skip vision encoder), --dry-run, --force, --tail, and manual overrides for memory/context/port.
Would love feedback from other DGX Spark owners. Did it take too long for you to configure it? Whatβs missing? Donβt hesitate to open a PR :)