Triton Inference Server not Loading yolo11 models

ffc0927e4460cf300fd61fb77 · December 9, 2025, 6:14am

I am trying to deploy multiple ONNX models on Triton Inference Server running on my Jetson AGX Orin.
Triton starts successfully, but no models are being loaded, even though the server itself is running fine.

Below is the log output:

this is my folder structure :

triton_models/
│
├── ppekit/
│ ├── config.pbtxt
│ └── 1/
│ └── model.onnx
│
├── gloves_shoes/
│ ├── config.pbtxt
│ └── 1/
│ └── model.onnx
│
└── ifr/
├── config.pbtxt
└── 1/
└── model.onnx

and i Followed this as a reference to deploy my models: Triton Inference Server with Ultralytics YOLO11

please help me to solve this issue and deploy mt models correctly to Triton Inferece Server

Y-T-G · December 9, 2025, 9:53am

You need to use the correct Docker image. For Jetson, you need the igpu docker image.

nvcr.io/nvidia/tritonserver:24.09-py3-igpu

Y-T-G · December 9, 2025, 9:55am

Also what command did you use to start the docker container? Did you pass --gpus? You should only pass --runtime nvidia. Not --gpus.

athkumar · December 9, 2025, 9:59am

Hi @ffc0927e4460cf300fd61fb77 ,

This is not a TensorRT issue. You should post this on the NVIDIA Jetson Forums or the Triton Inference Server GitHub.

What i can see from the logs:
CUDA driver version is insufficient for CUDA runtime version

You are running Triton 24.09 (which requires CUDA 12/JetPack 6).
Reference: TensorFlow Release 24.09 - NVIDIA Docs
Your Jetson is likely still on JetPack 5 (which uses CUDA 11 by default).
See here: JetPack SDK | NVIDIA Developer.

So downgrading to a proper Triton container version compatible with your JetPack version could work. I recommend checking with the above forums/channels for the resolution.

I’ll keep this thread open for others to have their opinions.

Thank you.

system · December 23, 2025, 9:59am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unable to load yolov7 model into triton inference server on Jetson Orin Developer kit Jetson AGX Orin inference-server-triton	7	527	March 12, 2024
How to deploy Yolov5 on Nvidia Triton via Jetson Xavier NX Jetson Xavier NX tensorrt , inference-server-triton	2	1394	February 2, 2022
Yolov11 Triton Inference Server Deployment Problem TensorRT tensorrt , inference-server-triton	3	567	February 10, 2025
Facing failed to load 'yolo' version 1: Internal: onnx runtime error 1: Load model from /data/yolo/1/best.onnx failed:Fatal error: TRT:EfficientNMS_T TensorRT onnx , inference-server-triton	1	492	May 31, 2024
YoloV4 Jupyter Notebook trained models with Triton errors TAO Toolkit	7	875	October 15, 2023
Triton server inference model placement TAO Toolkit	7	1069	February 23, 2022
Yolo model in a Triton server? NGC GPU Cloud	0	915	February 16, 2022
Triton Image for jetson nano TAO Toolkit	6	911	July 6, 2022
Jetson Orin Nano development kit with Triton Inference Server GPU problem DeepStream SDK yolo	3	62	January 20, 2026
Deploy yolov5 on triton server for Jetson Xavier NX DeepStream SDK inference-server-triton	4	1401	February 1, 2022

Triton Inference Server not Loading yolo11 models

Related topics