NVIDIA Developer Forums

OptiLLM

Accelerated Computing DGX Spark / GB10 User Forum DGX Spark / GB10

eparin82 May 15, 2026, 12:29pm 1

Topic		Replies	Views	Activity
Optimizing Inference on Large Language Models with NVIDIA TensorRT-LLM, Now Publicly Available Technical Blog	8	2072	January 25, 2024
Inference Stack Choice for Mixed Jetson Orin + DGX Spark Environment (Ollama vs vLLM?) DGX Spark / GB10	5	468	February 18, 2026
VLLM -- the $150M train wreck? DGX Spark / GB10 llama	24	1474	February 27, 2026
DGX Spark performance DGX Spark / GB10	49	5988	February 13, 2026
TensorRT-Edge-LLM DGX Spark / GB10	2	243	April 29, 2026
Can someone please just help me set the DGX Spark up for optimal LLM use? DGX Spark / GB10 llama	11	787	June 20, 2026
Setting up vLLM, SGLang or TensorRT on two DGX Sparks DGX Spark / GB10	16	2031	December 7, 2025
Model Orchestration and Deployment DGX Spark / GB10 nim	4	831	November 24, 2025
DGX Spark crashes when running tensorrt-llm DGX Spark / GB10 llama	3	242	March 7, 2026
Supercharging Llama 3.1 across NVIDIA Platforms Technical Blog	13	425	September 17, 2024