Question about Trt7.1 variable batch inference

496468034 · September 11, 2020, 10:21am

in trt5, serialize trt with shell:
./trtexec --onnx=example.onnx --saveEngine=example.trt --fp16 --batch=5
then deserialize example.trt in my c++ project and run inference with variable batch, the time consuming is direct ratio with batch num.
For example,doing inference two times in one program life time, first with one batch and time consuming is 10ms, second with two batch which time consuming is 20ms.

in trt7.1, serialize trt with shell:
./trtexec --onnx=example.onnx --saveEngine=example.trt --minShapes=input:1x3x128x128 --optShapes=input:4x3x256x256 --maxShapes=input:5x3x256x256
For example, i program to do inference with trt7.1 two times, first with 1 batch tand ime consuming is 20ms, second with 2 batch which also consuming 20ms.

I’m curious whether there is possible to do variable batch inference like using trt5 which time consumption gradually increases with batch number?

AastaLLL · September 14, 2020, 3:10am

Hi,

Do you need a dynamic input shape?
If the input is fixed to 256x256, please try our implicit mode with maximum batch size == 5.

$ /usr/src/tensorrt/bin/trtexec --onnx=example.onnx --maxBatch=5

Thanks.

496468034 · September 14, 2020, 3:15am

Thanks, I will try it later!
Btw, should i export my onnx model with dynamic batch size, for example set batch num = -1 in onnx model?

AastaLLL · September 15, 2020, 4:28am

Hi,

YES. This will allow you to use different batchsize with TensorRT much easier.
Thanks.

Topic		Replies	Views
Batch inference on tensorrt TensorRT tensorrt	4	428	February 15, 2021
Trtexec and dynamic batch size TensorRT	4	5420	July 22, 2021
Dynamic batch size TensorRT	3	4329	January 24, 2023
How to support dynamic batch size for TensorRT engine? TensorRT	1	1088	March 3, 2023
ONNX batchsize setting and buffer.h assert error TensorRT	3	1174	March 23, 2021
TensorRT use dynamic batch or specified batch? TensorRT	4	1764	November 8, 2022
TensortRT execute with variable batch size gave incorrect results TensorRT	1	424	November 9, 2021
Optimization using Inference batch size General Topics and Other SDKs	1	1019	January 19, 2022
How can I batch in TensorRT 6? TensorRT	12	2995	October 12, 2021
TensorRT Batching Speed scales poorly TensorRT tensorrt , cuda	6	1725	September 30, 2021

Question about Trt7.1 variable batch inference

Related topics