ONNX Runtime-GenAI

johnnynunez · January 3, 2025, 4:07pm

🚀 ONNX Runtime-GenAI: Now Dockerized for Effortless Deployment! 🚀

We’re excited to announce that the ONNX Runtime-GenAI plugin has been fully dockerized, simplifying its deployment and usage for developers working on NVIDIA Jetson Orin devices. Thanks to the recent pull request #767 by @dusty, this cutting-edge plugin for ONNX Runtime is now available through prebuilt Docker containers.

💡 Why This is a Game-Changer:

Plug-and-play setup: Skip the manual compilation; the Docker container has everything pre-configured.
Optimized for Jetson Orin: Leverages hardware acceleration for seamless integration in edge AI projects.
Supports GenAI Workloads: Perfect for generative AI tasks requiring high efficiency and low latency.

🎉 A Special Thanks to @shahizat
Your insightful guide and contributions to the NVIDIA Developer Forums have paved the way for using this library effectively on Jetson Orin. Check out their foundational work here: Running Phi 3.5 Vision Using ONNX Runtime-GenAI.

🌐 How to Get Started:

Pull the container and explore ONNX Runtime-GenAI effortlessly.
Dive into GitHub PR #767 for more details.

Join the edge AI revolution with ONNX Runtime-GenAI! 🚀

shahizat · January 3, 2025, 4:14pm

Awesome @johnnynunez, thanks for demonstrating excellent teamwork skills!!!😊

sun.yan · August 28, 2025, 8:48am

I couldn’t find the onnxruntime_genai image at this URL: https://github.com/dusty-nv/jetson-containers/tree/master/packages/ml/onnxruntime_genai
It contains a Dockerfile that needs to be built manually, but the build isn’t successful.
So, is there an available image or a verified Dockerfile?

Topic		Replies	Views
Is onnxruntime-genai supported on Jetson? Jetson Orin NX onnx	5	166	May 28, 2025
Does Orin NX 8GB support the onnxruntime-genai? Jetson Orin NX onnx , generative_ai	2	96	June 9, 2025
Running Phi-3.5-vision using ONNX Runtime GenAI on the Nvidia Jetson Orin dev kits Jetson Projects jetson , vision-ai , phi-3-vision-128k-instruct	1	271	January 3, 2025
Announcing ONNX Runtime Availability in the NVIDIA Jetson Zoo for High Performance Inferencing Technical Blog	4	1419	August 9, 2021
Running ai docker containers on jetson orin nano with gpu support Jetson Orin Nano docker , generative_ai	8	1004	June 18, 2025
Onnxruntime jetpack 6.0 jetson orin nx Jetson Orin NX onnx	7	1926	July 30, 2024
How to build onnxruntime on Xavier NX Jetson Xavier NX tensorrt , onnx	2	3747	October 18, 2021
Text-generation-webui cannot work Jetson Orin Nano generative_ai	6	190	August 4, 2025
Ollama Docker in Jetson AGX Orin Jetson AGX Orin docker , generative_ai	2	596	November 26, 2024
Onnxruntime on JetPack 6.1 Jetson AGX Orin onnx	3	236	October 22, 2025

ONNX Runtime-GenAI

Related topics