tensorRT inference engine that setting bigger max_batch_size is slower?

1026107838 · April 10, 2019, 7:10am

Hi all!

In my exp, i found that tensorRT inference engine that initialized with bigger max_batch_size is slower than the engine that initialized with smaller max_batch_size.

For example, i have two engine, one initialized with max_batch_size is 32, we call it engine_a, the other one initialized with max_batch_size is 1, we call it engine_b. In my exp, both engine use batch_size is 1. the engine_a’s fps(frame per second) is 113, but the engine_b’s 125, which mean engine_a is slow 10% than engine_b while their are the same but the max_batch_size setting.

Is this result normal?

In my application, the batch_size is uncertain, i usually can set the max_batch_size very big(eg. 32), but my exp show that the engine will be slower, so it is not a good idea。

gabriel.filipek · April 10, 2019, 9:46am

I found something similar. I’m running benchmarks on high-end GPUs like P100, P40, V100 and what I found out is that bs 32 performs best (a bit better than 1, 8, 16), but values like 64, 128 etc. perform way worse than 32.

NVESJ · April 15, 2019, 6:26pm

Hello,

The max_batch_size parameter limits the batch size at runtime and TensorRT optimizes for this number of batches. It does not indicate this batch size will be used during runtime. In your experiments, both models run 1 batch during runtime, therefore, engine_b performs better since TensorRT optimizes on 1 batch.

Thanks.
NVIDIA ENTERPRISE SUPPORT

Topic		Replies	Views
TensorRT builder->setMaxBatchSize(maxBatchSize); question Jetson TX2	9	6698	October 18, 2021
Batchsize performance differs greatly in the two application methods of tensorrt TensorRT	2	731	April 4, 2019
The larger the batch size, the better when build engine? TensorRT tensorrt	3	1888	July 29, 2020
The default value of engine.max_batch_size is 32? TensorRT	4	1927	October 12, 2021
Input batch size is smaller than TensorRT engine batch size TensorRT	1	1049	March 28, 2022
TRT inference on batches is not giving any performance benefit Jetson TX2 tensorrt , nvbugs	11	1351	October 18, 2021
Inference on large batch size TensorRT	5	4712	September 21, 2018
TensorRT 5 builder when set max_batch_size to 8 the output shape? Jetson TX2	3	1494	October 18, 2021
Number of batches during inference. TensorRT	5	1151	October 12, 2021
TensorRT 5.0.2 Batch Size Problem: bigger batch size Inference Time increase??? General	6	1661	October 12, 2021

tensorRT inference engine that setting bigger max_batch_size is slower?

Related topics