Input batch size is smaller than TensorRT engine batch size

srsjd · March 18, 2022, 2:49am

I found that TensorRT model wtih batch-size of 6 can be used to infer an input with batch-size less than such as 4. My question is that would it be more apporiate to use a TensorRT engine with batch-size of 4 to infer on a 4 batch input? What difference does it make?

My setup is the following:

Jetson Xavier
DeepStream 5.0
JetPack 4.4
TensorRT 7.1.3
NVIDIA GPU Driver Version 10.2

spolisetty · March 28, 2022, 4:47pm

Hi,

Dynamic shape means the dimension can change in a range [min, max].
So here optimization profile is used to tell TensorRT the min/max/opt value to build the engine.

Thank you.

Topic		Replies	Views
Question about tensorRT batch size DeepStream SDK tensorrt	2	881	October 12, 2021
why batchsize is larger the per-image inference time is faster for a specific input size? TensorRT	1	887	February 25, 2020
tensorRT inference engine that setting bigger max_batch_size is slower? TensorRT	3	843	October 12, 2021
How can I use the dynamic shape of tensorrt ？ TensorRT	3	2197	April 8, 2021
TensorRT builder->setMaxBatchSize(maxBatchSize); question Jetson TX2	9	6489	October 18, 2021
about working with dynamic shapes TensorRT	5	1138	January 9, 2020
How to support dynamic batch size for TensorRT engine? TensorRT	1	1091	March 3, 2023
Is there any easy way to easily get Determinism for different batch size inferring? TensorRT	1	364	June 29, 2022
Input size config with performances DeepStream SDK tensorrt , performance	3	514	April 24, 2023
Question about Trt7.1 variable batch inference Jetson Xavier NX tensorrt	4	369	October 18, 2021

Input batch size is smaller than TensorRT engine batch size

Related topics