|
PyTorch CUDACachingAllocator NVML assertion when sharing CUDA context with llama.cpp on Orin Nano 8 GB (JetPack 6.2.2)
|
|
2
|
15
|
May 14, 2026
|
|
ComfyUI setup optimized for DGX Spark
|
|
8
|
1280
|
May 11, 2026
|
|
Request a higher RPM in NVIDIA NIM (40 to 200)
|
|
0
|
39
|
May 9, 2026
|
|
AI Models That Run on Jetson Orin Nano Super (8GB) — A Practical Guide
|
|
6
|
2725
|
May 2, 2026
|
|
Rate Limit Increase Request: Academic Research on AI-Driven Zero Trust Architecture (ZTA)
|
|
2
|
23
|
April 29, 2026
|
|
ComfyUI: High-Performance Model Loading and DGX Spark Optimizations
|
|
3
|
398
|
April 26, 2026
|
|
Request to increase RPM limit for NIM API (Development)
|
|
5
|
292
|
April 17, 2026
|
|
Request for Rate Limit Increase for NIM API
|
|
2
|
111
|
April 16, 2026
|
|
[GB10] vLLM + DeepSeek-R1-32B Stable Setup on Blackwell — Full Protocol After 4 Days of Failures
|
|
4
|
434
|
March 17, 2026
|
|
Experiences running Qwen/Qwen3-Coder-Next?
|
|
11
|
1379
|
April 8, 2026
|
|
"unable to allocate CUDA0 buffer" after Updating Ubuntu Packages
|
|
245
|
15636
|
March 13, 2026
|
|
The Ollama journal shows "unable to find a kv cache slot"
|
|
6
|
128
|
March 12, 2026
|
|
Unable to run Nemotron AGX Thor Dev Kit
|
|
10
|
227
|
March 11, 2026
|
|
Running Qwen3.5 35B-A3B (MoE) on the Thor module's self-developed carrier board, the machine automatically powers down
|
|
5
|
249
|
March 24, 2026
|
|
Open-Jet: self-hosted Agentic TUI for air-gapped Jetsons
|
|
0
|
87
|
March 2, 2026
|
|
Benchmarking VLM on Orin
|
|
7
|
298
|
March 2, 2026
|
|
How to use Qwen3-ASR-0.6B on jetson orin nano?
|
|
2
|
308
|
March 23, 2026
|
|
SAM2 TensorRT engine produces different results from PyTorch on Jetson Orin Nano (JetPack 6.1)
|
|
4
|
142
|
February 28, 2026
|
|
VILA-1.5-13b-AWQ inference issue on Jetson AGX Orin via tinychat
|
|
2
|
34
|
February 23, 2026
|
|
Low ViT Performance Gain on Jetson Thor Using FP8 vs FP16
|
|
13
|
756
|
February 13, 2026
|
|
Ollama errors orin nano
|
|
43
|
2658
|
March 23, 2026
|
|
Llama3.2:3b randomly outputting "GGGGGGGG" when running under ollama on Jetson Orin Nano Super (JP6.2)
|
|
43
|
1192
|
February 25, 2026
|
|
How can I achieve a real-time 15fps performance using Nanoowl on Jetson Orin NX?
|
|
6
|
95
|
February 9, 2026
|
|
Tensort-RT LLM Support for Jetson
|
|
2
|
130
|
January 22, 2026
|
|
Qwen3-VL-4B fine-tune
|
|
2
|
369
|
January 22, 2026
|
|
DGX Spark Playbooks Update - Jan 2026
|
|
1
|
978
|
January 21, 2026
|
|
How Jetson allocate memory for GPU?
|
|
5
|
384
|
January 20, 2026
|
|
Help on llama.cpp command line arguments and compilation settings (performance testing included)
|
|
7
|
2193
|
January 9, 2026
|
|
vLLM 0.12.x Container for jetson Thor
|
|
4
|
255
|
January 8, 2026
|
|
使用comfyui生成视频时,提示这个错误
|
|
4
|
110
|
January 8, 2026
|