Jetson Container text-generation-webui not loading models

esteban.gallardo · May 22, 2024, 5:26pm

I’m developing an app that use LLM Endpoints to request for data. I’m trying to use the jetson container text-generation-webui just as described here:

https://www.jetson-ai-lab.com/tutorial_text-generation.html

Unfortunately I cannot load any model, no matter what model, no matter the model loader. See image:

I’ve been working successfully with stable-diffusion-webui container and I would like to be able to do the same with text-generation-webui

The specs of my Jetson Orin AGX are:

Any help would be appreciated.

dusty_nv · May 22, 2024, 5:59pm

Hi @esteban.gallardo, it looks like you are trying to load a GPTQ model with the llama.cpp loader - llama.cpp only works with GGUF models. IIRC, text-generation-webui expects the .gguf model file to be saved under it’s model directory (/data/models/text-generation-webui)

By the way, for llama.cpp loader you should increase the n-gpu-layers setting (typically to the max), otherwise it will not use GPU.

esteban.gallardo · May 23, 2024, 8:48am

Unfortunatelly that didn’t work.

kayccc · June 5, 2024, 5:27am

Continues the discussion at Enabling API for jetson container AudioCraft - Jetson & Embedded Systems / Jetson AGX Orin - NVIDIA Developer Forums

system · June 19, 2024, 5:28am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Problems with "Tutorial - text-generation-webui" Jetson Orin Nano generative_ai	6	248	February 24, 2025
Couldn't find a compatible container for text-generation-webui Jetson AGX Orin containers , generative_ai	11	186	January 23, 2025
Performance Issues with LLM model on NVIDIA Jetson Orin NX (16GB) Jetson Orin NX generative_ai	2	942	June 13, 2024
Want to run a Local LLM on Nvidia Jetson AGX Orin Jetson AGX Orin generative_ai	3	2810	July 17, 2024
Unable to Utilize GPU for LLM on NVIDIA Jetson AGX Orin Jetson AGX Orin generative_ai	4	212	July 4, 2024
Jetpack6 llamacpppython Jetson AGX Orin generative_ai , llama	5	346	January 28, 2025
How to set n_gqa for loading 70B model in text-generation-webui Jetson AGX Orin jetson-inference , generative_ai	2	576	January 2, 2024
Jetson Generative AI Playground - Tutorial 1 - Text Generation Jetson Orin Nano generative_ai	5	867	September 27, 2023
Unable to Utilize GPU for LLM on NVIDIA Jetson AGX Orin Jetson AGX Orin generative_ai	4	207	July 4, 2024
TensorRT-LLM for Jetson Jetson AGX Orin generative_ai	10	1876	April 21, 2025

Jetson Container text-generation-webui not loading models

Related topics