Triton server getting error

sachinsoni2102001 · February 14, 2024, 8:30pm

i am using this code for infrencing that is available on ultralyics site:

import contextlib
import tritonclient.http as httpclient #pip install if needed
from ultralytics import YOLO
import subprocess
import time
import cv2
from tritonclient.http import InferenceServerClient
import os

setup triton inference client

triton_client = InferenceServerClient(url=‘localhost:8000’, verbose=False, ssl=False)

Load the Triton Server model

model = YOLO(f’http://localhost:8000/yolov8n’, task=‘detect’)

directory=“pathtodirectory”
for filename in os.listdir(directory):

video_path=os.path.join(directory, filename)
cap = cv2.VideoCapture(video_path)

# Loop through the video frames
while cap.isOpened():
    # Read a frame from the video
    success, frame = cap.read()
    

    if success:
        # Run YOLOv8 inference on the frame
        
        
        
        results = model(frame)
        
    
        

        # Visualize the results on the frame
        annotated_frame = results[0].plot()

        # Display the annotated frame
        cv2.imshow("YOLOv8 Inference", annotated_frame)

        # Break the loop if 'q' is pressed
        if cv2.waitKey(1) & 0xFF == ord("q"):
            break
    else:
        # Break the loop if the end of the video is reached
        break

# Release the video capture object and close the display window
cap.release()
cv2.destroyAllWindows()

if i export model in default settings i.e. (model.export(onnx)) than the infrencing works fine. i am getting results to any size of input .by default input is =640 (specified by ultraytics)

but i export model specifing size (model.export(onnx, imgsz=800)) and
than run on triton server it gives errors:
tritonclient.utils.InferenceServerException: [400] [request id: <id_unknown>] unexpected shape for input ‘images’ for model ‘yolov8n’. Expected [1,3,800,800], got [1,3,640,640]

for config.pbxt i am using:

name: “yolov8n”
platform: “tensorrt_plan”
max_batch_size: 0
input [
{
name: “images”
data_type: TYPE_FP32
dims: [1, 3, 800,800]
}
]
output [
{
name: “output0”
data_type: TYPE_FP32
dims: [1, 84, 13125]
}
]
instance_group [
{
kind: KIND_GPU,
count: 1
}
]

any solution of this problem?

Topic		Replies	Views
Unable to load yolov7 model into triton inference server on Jetson Orin Developer kit Jetson AGX Orin inference-server-triton	7	420	March 12, 2024
Issues with setting up Dynamic Batching for Triton server TensorRT inference-server-triton	1	99	March 6, 2025
Deepstream yolov8 trition server load the model plan DeepStream SDK inference-server-triton , deepstream61	4	748	December 8, 2023
Yolov11 Triton Inference Server Deployment Problem TensorRT tensorrt , inference-server-triton	3	236	February 10, 2025
YOLOV4-DS-TRITON/got an error about input unmatch TensorRT	4	593	May 26, 2021
Triton Inference through docker DeepStream SDK	6	1415	March 16, 2022
Tensorrt YOLOv8 with deepstream python DeepStream SDK	11	1626	March 13, 2024
YOLOX with Triton Inference Server output interpretation AI Foundation Models and Endpoints yolo , python , inference-server-triton	0	75	August 23, 2024
Nvinferserver inference with onnx returns wrong predictions DeepStream SDK	3	138	June 13, 2024
YOLOV4-DS-TRITON/got an error about input unmatch DeepStream SDK	14	717	October 12, 2021

Triton server getting error

setup triton inference client

Load the Triton Server model

for config.pbxt i am using:

Related topics