TensorRT for Large Language Models

Will this be available for Jetson Orin?

Yes, we’re trying that, you can refer to LLaMa 2 LLMs w/ NVIDIA Jetson and textgeneration-web-ui - Jetson & Embedded Systems / Jetson Projects - NVIDIA Developer Forums