TensorRT builder->setMaxBatchSize(maxBatchSize); question

rockking.jy · December 9, 2017, 1:17am

I’ve seen in the TensorRT developer guide document that there is a:

builder->setMaxBatchSize(maxBatchSize);

with explanation:
‣ maxBatchSize is the size for which the engine will be tuned. At execution time, smaller batches may be used, but not larger.

But I am not quite clear of this parameter, could anyone help me to clarify this parameter?

What does BatchSize here means? Is it the same meaning of batch size in deep neural work training? Or other meanings?

AastaLLL · December 11, 2017, 3:04am

Hi,

It is just as the batch size you used in training.

TensorRT will create an inference engine when initial.
The maximal batch size is required for allocating the memory of network.

Once the batch size is given, you can launch the engine with the batch size <= given.
Thanks.

rockking.jy · December 11, 2017, 3:39am

And actually the question is, for example, for the task of image classification, I need to classify quicker the images on the TX2. I set maxBatchSize=8, will it be much quicker than maxBatchSize=1?
If not, what exactly is this maxBatchSize used for? Because if I understand right, Batch Size is only useful for stochastic gradient descent in training, not for inference or real running on TX2?

rockking.jy · December 11, 2017, 3:42am

Is it possible to give for example 8 images at one time and return a vector of results? I mean “batch inference” in imageNet?

AastaLLL · December 12, 2017, 6:46am

Hi,

The input of TensorRT is in NHWC format.
N indicates the batch size of a network.

For example, N=8 means classify eight images with one execute() code.
Speed is batch=1 > batch=8 > batch=1 x 8 (fast → slow)

So if you need to classify eight images at a time (Ex, from 8 different input stream), you can launch TensorRT with batch=8 instead of calling eight times of batch=1 to have better performance.

Thanks.

longzhu_71 · May 7, 2019, 7:34am

Hi,
I would like to know ，When I set the maxBatchSize to 8, what is the corresponding output?
for input （3，224，224），maxBatchSize set to 1, output is (1,101)
for input （3，224，224），maxBatchSize set to 8, my output is still (1,101),but why not (8,101)?

Thanks.

AastaLLL · May 9, 2019, 2:06am

Hi, longzhu_71

It looks like you already file another topic:
[url]https://devtalk.nvidia.com/default/topic/1051423/jetson-tx2/tensorrt-5-builder-when-set-max_batch_size-to-8-the-output-shape-/[/url]

Let’s track this on the new topic directly.
Thanks.

kamatrohan13 · July 8, 2019, 11:19pm

So, if the training data is 1 million images, is it like the training will occur in batches of 8(which is done in parallel )?
New to Tensorrt. Please help.

AastaLLL · July 23, 2019, 8:45am

Hi,

TensorRT is good for inference but not recommended to be used for training.
So you will need to check the training framework of your model for the detail.

For example, you can set batchsize to 64, 128, 256, … in TensorFlow:
[url]https://www.tensorflow.org/guide/datasets_for_estimators[/url]

Thanks.

Topic		Replies	Views
TensorRT 5 builder when set max_batch_size to 8 the output shape? Jetson TX2	3	1409	October 18, 2021
TensorRT ------ maxBatchSize & batchSize ------ kFLOAT & kHALF ------ sampleUffMNIST.cpp Jetson TX2	4	3437	October 18, 2021
TensorRT - max_batch_size issue Jetson TX2	6	1587	October 18, 2021
tensorRT inference engine that setting bigger max_batch_size is slower? TensorRT	3	837	October 12, 2021
The larger the batch size, the better when build engine? TensorRT tensorrt	3	1552	July 29, 2020
tensorrt execute batch_size >1 images Jetson TX2	2	916	October 18, 2021
TRT inference on batches is not giving any performance benefit Jetson TX2 tensorrt , nvbugs	11	1162	October 18, 2021
The default value of engine.max_batch_size is 32? TensorRT	4	1774	October 12, 2021
Batchsize performance differs greatly in the two application methods of tensorrt TensorRT	2	668	April 4, 2019
Jetson Xavier - Inference multiple images Jetson AGX Xavier	7	1003	October 18, 2021

TensorRT builder->setMaxBatchSize(maxBatchSize); question

Related topics