|
[Help Needed] Building vLLM dependencies inside SGLang official image for Eagle-3 Speculative Decoding
|
|
8
|
34
|
December 5, 2025
|
|
Jetson Orin Nano Super Developer Kit throttles on Gemma 3:4b
|
|
5
|
36
|
December 4, 2025
|
|
How to Run vLLM ≥ 0.11.0 on Jetson AGX Orin?
|
|
7
|
43
|
December 4, 2025
|
|
Unsloth Finetuning Playbook - Fine-tuning GPT-OSS-20B with GB10 Forum Data
|
|
0
|
119
|
November 27, 2025
|
|
Certificate verify failed while installing NIM models
|
|
0
|
21
|
November 26, 2025
|
|
How to extract nested loop features from CUDA kernels for LLM-based optimization?
|
|
0
|
30
|
November 25, 2025
|
|
DGX Spark: The Sovereign AI Stack — Dual-Model Architecture for Local Inference
|
|
0
|
133
|
November 22, 2025
|
|
Deploy Qwen2.5-VL-7B via TensorRT-LLM in Jeston Orin
|
|
2
|
61
|
November 21, 2025
|
|
Request for suitable vLLM Docker for Jetson AGX Orin with CUDA 12.6
|
|
2
|
39
|
November 19, 2025
|
|
5x Build Failure: jetson-containers PyTorch for nano_llm on Orin Nano (Exit Status 1)
|
|
1
|
49
|
November 19, 2025
|
|
TRT LLM for Inference with NVFP4 safetensors slower than LM studio GGUF on the Spark
|
|
5
|
370
|
November 15, 2025
|
|
Gen AI Benchmarking: LLMs and VLMs on Jetson
|
|
7
|
74
|
November 5, 2025
|
|
Orin Nano Qwen3-VL-4B
|
|
7
|
442
|
November 3, 2025
|
|
Build "Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation" on DGX Spark
|
|
2
|
168
|
November 1, 2025
|
|
My Jetson AGX Orin 64GB Developer Kit GPU Don't Dectected and Nvidia-smi can't dectected GPU and jetpack not dectectd
|
|
3
|
34
|
October 31, 2025
|
|
Pre-training Nanochat on DGX Spark (Standalone and Clustered mode)
|
|
0
|
242
|
October 30, 2025
|
|
Install ComfyUI on Jetson AGX Orin 32gb - Ubuntu 20.04 / JP5.1.5 / L4T 35.6.3 / CUDA 11.4
|
|
5
|
73
|
October 30, 2025
|
|
Unable to load large models on Jetson Orin Nano Super despite sufficient RAM
|
|
6
|
163
|
October 28, 2025
|
|
Jetson Agx thor
|
|
17
|
194
|
October 27, 2025
|
|
tensorrt推理Qwen/Qwen3-VL-8B-Instruct不兼容
|
|
4
|
125
|
October 27, 2025
|
|
LLM inference results?
|
|
3
|
64
|
October 27, 2025
|
|
Open Voice OS on Jetson Orin Nano: Offline AI Assistant with LLM + TTS + STT on K3s!
|
|
2
|
1661
|
October 24, 2025
|
|
VILA1.5-3b (MLC/nano_llm) Fails to Output Strict 'YES'/'NO' Format on Jetson Orin Nano
|
|
4
|
64
|
October 9, 2025
|
|
The token speed of LLM on Jetson AGX Orin
|
|
5
|
272
|
October 22, 2025
|
|
Request for Lab Extension – Building LLM Applications With Prompt Engineering
|
|
3
|
84
|
July 14, 2025
|
|
LLM video Series Multimodal Rag, Building Multimodal AI RAG with LlamaIndex, NVIDIA NIM, and Milvus | LLM App Development
|
|
0
|
102
|
July 8, 2025
|
|
Adapt a new model with a structure similar to LLaMA3
|
|
1
|
48
|
June 30, 2025
|
|
NanoLLM - Video querying multiple streams simultaneously
|
|
2
|
75
|
June 17, 2025
|
|
Upcoming Webinar and Livestream - Supercharge AI Agents with Data Flywheels
|
|
0
|
120
|
June 10, 2025
|
|
Deploying Triton Server with TensorRT-LLM on Jetson AGX Orin (JetPack 6.2) — Any Working Example?
|
|
10
|
699
|
June 17, 2025
|