How to enable dynamic batching for models on triton inference server

benjamin.hogm · May 22, 2023, 9:29am

Description

I am trying to host a custom model on Triton Inference server and I am trying to enable dynamic batching for the model.

I am converting a pytorch model to onnx and enabling dynamic batching on the input and output nodes.

I attach a copy of the script i use to convert from pytorch to onnx in the attachments and a copy of my config.pbtxt file.

Relevant Files

export_onnx.py (683 Bytes)
config.pbtxt (270 Bytes)

Error Encountered

E0522 09:23:57.598550 82 model_repository_manager.cc:1215] failed to load ‘par’ version 1: Invalid argument: model ‘par’, tensor ‘512’: for the model to support batching the shape should have at least 1 dimension and the first dimension must be -1; but shape expected by the model is [1,22]

Topic		Replies	Views
Issues with setting up Dynamic Batching for Triton server TensorRT inference-server-triton	1	50	March 6, 2025
Model tensor shape configuration hints for dynamic batching but the underlying engine doesn't support batching Triton Inference Server - archived	4	2400	October 12, 2021
Can I transform a shape tensor to a normal tensor? TensorRT	1	381	December 28, 2020
Trtexec and dynamic batch size TensorRT	4	5405	July 22, 2021
MMAPI 04 example, only support dynamic batch(N=-1) onnx model Jetson Xavier NX mmapi , onnx	2	522	October 18, 2021
TRITON's config.pbtxt only accepts 3dim input layers? Triton Inference Server - archived tensorrt , pytorch	4	1667	October 12, 2021
Batching preprocess in Triton Frameworks inference-server-triton	0	476	July 25, 2023
Dynamic batching for tensorrt engine model TAO Toolkit	5	1138	January 4, 2022
Dynamic batch size TensorRT	3	4252	January 24, 2023
Tensorrt, convert pytorch onnx module dynamic batch failed General Topics and Other SDKs tensorrt , ubuntu	0	604	February 11, 2022

How to enable dynamic batching for models on triton inference server

Description

Relevant Files

Error Encountered

Related topics