LLaMa 2 LLMs w/ NVIDIA Jetson and textgeneration-web-ui

Robotics & Edge Computing Jetson Systems Jetson Projects

dusty_nv September 12, 2023, 2:45am 21

llava-llama-2-13b-chat-lightning-gptq through oogabooga: RAM usage went from 14.17GB → 20.39GB (6.22GB), but that seems low so take it with a grain of salt. That was after querying it on an image a few times. Think it should run in 16GB though.

Topic		Replies	Views
Introducing Ollama Support for Jetson Devices Jetson Projects cuda , natural-language-processing-nlp , artificialintelligence , interactive , docker-machine-learning , generative_ai	29	13708	August 28, 2024
Ollama and Jetson issue Jetson Orin NX jetson-inference , generative_ai	12	6149	March 20, 2024
I want to try LLaVa with Jetson Orin Jetson AGX Orin generative_ai	5	1178	March 10, 2024
Available with Small Language Model on tutorial Jetson Orin Nano generative_ai	3	979	May 3, 2024
Running llama3.3 or llama4 on Jetson AGX Orin Developer Kit (64 GB) Jetson AGX Orin generative_ai	8	1045	May 12, 2025
MiniGPT-4 on Jetson Orin Nano 8Gb Dev kit not working Jetson Orin Nano generative_ai	9	591	May 28, 2024
Triton Inference Server + vLLM Backend on the NVIDIA Jetson AGX Orin 64GB Developer Kit Jetson Projects generative_ai	9	1126	June 16, 2025
Cannot run LLaVa with Orin NX Jetson Orin NX generative_ai	7	480	August 1, 2024
Gen AI Benchmarking: LLMs and VLMs on Jetson Jetson AGX Orin llm	7	142	November 5, 2025
LLMs token/sec Jetson AGX Orin generative_ai	2	1213	April 8, 2024

LLaMa 2 LLMs w/ NVIDIA Jetson and textgeneration-web-ui

Related topics