Want to run a Local LLM on Nvidia Jetson AGX Orin

mausam.jain · July 17, 2024, 3:59am

I am looking to run a local LLM (Large Language Model) on an Nvidia Jetson AGX Orin over the GPU CUDA Cores . Could anyone provide guidance or share resources on how to achieve this?

I was able to run a local LLM (.gguf model) over the CPU but unable to utilize the GPU.

Thank you in advance for your help!

kayccc · July 17, 2024, 4:24am

You may check Jetson AI Lab - Home Assistant Integration - Jetson & Embedded Systems / Jetson Projects - NVIDIA Developer Forums

dusty_nv · July 17, 2024, 9:31pm

Hi @mausam.jain, we provide containers for llama.cpp, ollama, and text-generation-webui that were compiled with CUDA enabled in llama.cpp: https://github.com/dusty-nv/jetson-containers

We also have ollama and oobabooga tutorials on Jetson AI Lab that will run quantized GGUF models:

system · August 14, 2024, 4:51am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unable to Utilize GPU for LLM on NVIDIA Jetson AGX Orin Jetson AGX Orin generative_ai	4	296	July 4, 2024
Unable to Utilize GPU for LLM on NVIDIA Jetson AGX Orin Jetson AGX Orin generative_ai	4	291	July 4, 2024
TensorRT for Large Language Models Jetson AGX Orin	2	622	September 11, 2023
Seeking Advice on Running Quantized Large Language Models on Jetson AGX Xavier Jetson AGX Xavier generative_ai	2	967	March 19, 2024
LLaMa 2 LLMs w/ NVIDIA Jetson and textgeneration-web-ui Jetson Projects generative_ai	86	25190	May 10, 2024
How to run local llm with cuda 10.2 support Jetson Nano generative_ai	5	1738	May 22, 2024
NVIDIA NIM - TensorRT TurboXL Jetson AGX Orin generative_ai , nim	2	352	September 16, 2024
Issue with Nvidia Jetson AGX Orin Developer Kit (64 Gb) Jetson AGX Orin cuda , generative_ai	5	160	July 30, 2025
Ollama is running slow on Jetson AGX Orin Dev-kit (32G) Jetson AGX Orin generative_ai	2	1197	February 29, 2024
LLMs token/sec Jetson AGX Orin generative_ai	2	1123	April 8, 2024

Want to run a Local LLM on Nvidia Jetson AGX Orin

Related topics