Creation of Triton inference server for yolov8 tensorrt model endpoint on Sagemaker

pari.kashyap · April 1, 2025, 5:49am

Hi,
I am building a Dockerized Triton inference server for a TensorRT model.
Base Image - nvcr.io/nvidia/tritonserver:24.02-py3 or nvcr.io/nvidia/tritonserver:23.02-py3
For model I have .engine file which is made with the help of ultralytics.
I am using the g4dn.xlarge instance for this process.

Issues I Am Facing:
Container Runs, But API Doesn’t Respond which means triton server doesn’t respond because of which inference script fails as well.

Here is my config.pbtxt:


name: "yolov8_tensorrt"
platform: "tensorrt_plan"
default_model_filename: "bestEHS.engine"
max_batch_size: 4
input [
  {
    name: "images"
    data_type: TYPE_FP32
    dims: [3, 640, 640]
  }
]
output [
  {
    name: "output0"
    data_type: TYPE_FP32
    dims: [84, 8400]
  }
]
instance_group [
  {
    kind: KIND_GPU
  }
]

Folder structure:

yolov8-triton-tensorrt/
│── models/
│   ├── yolov8_tensorrt/
│   │   ├── 1/
│   │   │   ├── bestEHS.engine
│   │   ├── config.pbtxt
├── bus.jpg
├── Dockerfile

Error of version and triton server initialization:

I have tried different ways as well but a lot of them had only the problem of triton server can’t be initialized, or it is not up or running.

Topic		Replies	Views
Inferencing on DINO in triton inference server TensorRT inference-server-triton	1	167	August 29, 2024
Unable to run Triton example TensorRT inference-server-triton	1	1096	May 31, 2024
Triton server getting error JAX inference-server-triton	0	433	February 14, 2024
Issue while Serving the model using Triton Server Triton Inference Server (archived) inference-server-triton	0	522	November 18, 2020
Error while running Tritron server DeepStream SDK	2	363	January 12, 2024
Mistral AI Models TensorRT cudnn	1	433	June 25, 2024
Inference server failing with YoloV3 Object detection serialized Tensorrt Engine Triton Inference Server (archived) tensorrt	0	929	June 25, 2020
How to run a tao yolov4 model in triton inference server Maxine inference-server-triton , client	0	458	September 14, 2023
Triton Inference Server not supporting PyTorch v1.6? DeepStream SDK pytorch , inference-server-triton	13	2458	October 12, 2021
Triton Inference Server TensorRT tensorrt , inference-server-triton	1	490	June 21, 2023

Creation of Triton inference server for yolov8 tensorrt model endpoint on Sagemaker

Related topics