TensorRT ------ maxBatchSize & batchSize ------ kFLOAT & kHALF ------ sampleUffMNIST.cpp

rajathswaroop.mulky · December 16, 2017, 6:20am

Hi,

If I have a LeNet model in Tensorflow, I can set different batch sizes for training and testing/inferencing.

I want to run the “sampleUffMNIST.cpp” on a Jetson TX2 with the “lenet5.uff”. How do I change the batch size for inferencing ?

I think maxBatchSize is for allocating required memory. And I am not able to change batchSize!

I do not see any significant difference in the average time taken on changing (nvinfer1) kFLOAT & kHALF with the above files and setup. It is the same effect if I run more number of images. Is there anything that I am missing?

Thank you.

AastaLLL · December 18, 2017, 7:25am

Hi,

Batch information is set here: context->execute(batchSize, &buffers[0])

setMaxBatchSize is for creating the TensorRT engine.
The batch parameter in execute() is the inference number when runtime.

FP16 cuts memory in half. But it does not always double the performance.
The time to process a specific layer (Ex. IP layer) may be longer in FP16 mode.
It is encouraged to compare the performance between FP16 and float.

Thanks.

rajathswaroop.mulky · December 18, 2017, 7:46am

Hi,

Thank you for your response.

I will check FP16 (kHALF) performance on other models.

Regrading the batchSize:
I had already tried changing the batchSize in that location.
In the “void execute(ICudaEngine& engine)” function, I changed “int batchSize = 1” to “int batchSize = 128” or other values.
This creates an error in the “void* createMnistCudaBuffer” function. This “assert(eltCount == INPUT_H * INPUT_W)” assert fails.
Because in the “calculateBindingBufferSizes” function “eltCount = volume(dims) * batchSize”.
eltCount changes for different batch sizes but the assert statement requires it to be constant with respect to the image dimensions.

How do I overcome this and proceed from here ?

Thank you.

AastaLLL · December 19, 2017, 8:18am

Hi,

If you want to inference batch=128 images at a time, please also prepare N=128 input and output buffer.

Input dimension: NxHxWxC
Output dimension: NxClass

Thanks.

Topic		Replies	Views
TensorRT builder->setMaxBatchSize(maxBatchSize); question Jetson TX2	9	6493	October 18, 2021
TRT inference on batches is not giving any performance benefit Jetson TX2 tensorrt , nvbugs	11	1186	October 18, 2021
Low Compute utilization of converted TensorFlow model during inference Jetson TX2	19	1695	October 18, 2021
TemsorRT Fp16 mode Jetson TX1	6	1269	October 18, 2021
Batchsize performance differs greatly in the two application methods of tensorrt TensorRT	2	673	April 4, 2019
How to set parameters when using TensorRT for optimizing InceptionV4 using Jetson TX2? Jetson TX2	2	416	October 18, 2021
TensorRT - max_batch_size issue Jetson TX2	6	1606	October 18, 2021
Optimization using Inference batch size General Topics and Other SDKs	1	1019	January 19, 2022
Calibrating with multiple images to create TRT engine TensorRT	1	383	May 28, 2020
TensorRT 5.0.2 Batch Size Problem: bigger batch size Inference Time increase??? General	6	1550	October 12, 2021

TensorRT ------ maxBatchSize & batchSize ------ kFLOAT & kHALF ------ sampleUffMNIST.cpp

Related topics