Triton on Jetson Orin

utkarsh.tiwariblr47 · December 24, 2023, 12:13pm

I was wondering how can i run Triton inference server on my Jetson Orin device. Is there any docker image already available. As of what i have understood by now that Triton is not yet available for Jetson devices. Please help me with this. I want to run Triton on my Jetson orin

DaneLLL · December 25, 2023, 5:49am

Hi,
Do you use AGX Orin or Orin Nano. This topic is in Orin Nano category but it looks like you are asking about AGX Orin. Would like to confirm which platform you are using.

AastaLLL · December 25, 2023, 6:58am

Hi,

Please check below:

Thanks.

utkarsh.tiwariblr47 · December 25, 2023, 7:36am

Hi , I just verified its AGX orin. I want to use Trition inference server on it. Can i directly pull docker image?. As of what i checked there are very few to no docs about Triton on Jetson

utkarsh.tiwariblr47 · December 25, 2023, 7:38am

HI, Thanks for the reply. But i want to use Triton inference server on AGX Orin. I want to run multiple (5 deep learning inferences) in parallel on AGX orin.

AastaLLL · December 26, 2023, 3:40am

Hi,

You can start with below tutorial:

Thanks.

utkarsh.tiwariblr47 · December 26, 2023, 1:53pm

I think the deepstream is limited to Multi object detection, Multi Image classification and Segmentation. But i want to run any model in parallel. Lets say i have machine learning models or any deep learning models (Speech enhancements, Super resolution ect…). I want to run multiple models and any sort of models in parallel on my AGX Orin device.

AastaLLL · December 27, 2023, 2:59am

Hi,

You can refer to the Triton document for the examples:

Thanks.

utkarsh.tiwariblr47 · December 27, 2023, 9:38am

Hi, thanks for the reply.

I dont really think you are getting me. I need parallel model inferences on AGX orin. I already mentioned that Triton provides dynamic batching feature for parallel model inference. But jetpack versions dont have Triton support. I need solution to use Triton( or any alternative) to run mutiple models in parallel at single time. Triton for jetpack doesnt have docker images. So do you have any inputs on this?

dusty_nv · December 29, 2023, 1:34am

Hi @utkarsh.tiwariblr47, check the Triton releases page on GitHub for the Jetson releases:

See here for the containers:

utkarsh.tiwariblr47 · January 3, 2024, 5:50pm

Thanks for your response.

If i follow the steps of installation in the page mentioned, do i need to do any configuration after the installation.

Doubt 2:

In this page it is mentioned " For Jetson devices which support Jetpack 6.0 and above, Triton now publishes containers, based on the latest version of Jetpack, on NGC with the suffix -igpu".

So does that mean i can simply pull the container image according to the following link. Would that avoid manual installation of Triton inference server on Jetson?

AastaLLL · January 4, 2024, 7:01am

Hi,

Yes, please use the container with -igpu tag and it should have Triton preinstalled.

Thanks.

system · January 30, 2024, 8:48am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Triton inference on AGX orin Jetson AGX Orin jetson-inference	2	617	December 26, 2023
Deploying Triton Server with TensorRT-LLM on Jetson AGX Orin (JetPack 6.2) — Any Working Example? Jetson AGX Orin tensorrt , jetson-inference , inference-server-triton , generative_ai , llm	10	232	June 17, 2025
TensorRT Triton server for multiple model instances DRIVE AGX Orin General driveos-dl	2	652	December 1, 2022
Inferencing multiple models - CGF / Driveworks/ Triton DRIVE AGX Orin General driveworks-dnn-framework	4	351	April 26, 2024
Run Triton kernels on Jetson AGX Orin Jetson AGX Orin inference-server-triton	14	3635	June 14, 2023
Triton infer server docker Orin NX 5.1.1 fails to start DeepStream SDK docker , inference-server-triton , jetson	2	209	May 9, 2024
Running a TF-TensorRT model as a inference server on the Xavier Jetson AGX Xavier tensorrt	8	909	October 18, 2021
Triton and/or Docker Serving support for JetPack, e.g. Xavier Triton Inference Server (archived)	0	503	December 18, 2020
Triton inference server on jetson xavier Jetson Xavier NX inference-server-triton	3	2033	October 18, 2021
Triton Inference Server + vLLM Backend on the NVIDIA Jetson AGX Orin 64GB Developer Kit Jetson Projects generative_ai	9	601	June 16, 2025

Triton on Jetson Orin

Related topics