TensorRT-LLM for Jetson

Robotics & Edge Computing Jetson & Embedded Systems Announcements

dusty_nv November 13, 2024, 7:01pm 1

TensorRT-LLM is a high-performance LLM inference library with advanced quantization, attention kernels, and paged KV caching. Initial support for TensorRT-LLM in JetPack 6.1 has been included in the v0.12.0-jetson branch of the TensorRT-LLM repo for Jetson AGX Orin.

We’ve made pre-compiled TensorRT-LLM wheels and containers available, along with these guides and additional documentation:

> TensorRT-LLM Deployment on Jetson Orin

1 Like

Topic		Replies	Views
Can TensorRT-LLM be used on Jetson Orin NX with JetPack 6.1? Jetson Orin NX tensorrt , generative_ai	6	193	December 17, 2024
TensorRT-LLM for Jetson Jetson AGX Orin generative_ai	9	1333	January 1, 2025
Running LLMs with TensorRT-LLM on Nvidia Jetson AGX Orin Dev Kit Jetson Projects jetson , generative_ai	1	370	December 8, 2024
Does TensorRT-LLM Supports on NVIDIA Jetson AGX Orin Edge Device? Jetson AGX Orin generative_ai	2	149	July 29, 2024
Nvidia Jetson Orin Nano tensorrt llm Jetson Orin Nano tensorrt , generative_ai	6	171	August 5, 2024
TensorRT for Large Language Models Jetson AGX Orin	2	586	September 11, 2023
Get error message as conver qwen to int-gptq in tensorrt-llm for agx orin DRIVE AGX Orin General driveworks-dnn-framework	4	84	December 10, 2024
Can I use TensorRT-LLM in Jetson AGX orin? Jetson AGX Orin nvbugs , generative_ai	3	575	July 15, 2024
NVIDIA NIM - TensorRT TurboXL Jetson AGX Orin generative_ai , nim	2	164	September 16, 2024
Jetson Generative AI Playground Site Launched Jetson AGX Orin	3	1384	October 19, 2023

TensorRT-LLM for Jetson

Related topics