Output of batch inference

RahnRYHuang · March 28, 2019, 10:10am

I’m trying to make my object detection on TensorRT program infer two images at the same time(batch inference). Inferred results from TensorRT are stored in an output array. When batchsize==1, the program works well; however, when I set batchsize==2, I can only get correct results of the first image, the results of the second image in the output array are zeros. The following is detailed description of my output result.

If I set batchSize=1, the size of output array is 136459, and I can get the predicted bounding boxes. When I set batchSize=2, the size of output array is 272918, I can get the predicted results of the first image in the indices of 0~136458. But the results of the second image, which should be starts from the 136059 index of the array, are zeros.

Snapshots of content of the output array:
[url]https://drive.google.com/open?id=1ZOV4SInQVzLCG_n_-vwoVkeqrVzZfckX[/url] ; Results of the first image
[url]https://drive.google.com/open?id=18YwUggoTjaEsS9WjAJ7QySCCp0heZna9[/url] ; Results of the second image

Additional information:

I’ve tried mTrtContext->execute() and mTrtContext->enqueue(), and the inferred results are the same.
Inference time if batchSize == 1: 18.5816 ms
Inference time if batchSize == 2: 35.7012 ms

I want to ask:

If my input array containing two images in a batch is correct, do the outputs after inferencing contain bounding boxes of two images?
Are the inferred result by using mTrtContext->execute() or mTrtContext->enqueue() guranteed correct?
Is there size limitation of output when using batch inference in TensorRT, which means is the output array of size 272918 is too big for TensorRT?

ethanyhzhang · February 18, 2020, 4:41am

I’ve encountered the same issue but with python API. Have you solved the problem? thank you and hope for your reply.

christian.m · February 18, 2020, 3:34pm

Are you making sure to adjust the binding dimensions?

context->setBindingDimensions(profile_idx, nvinfer1::Dims4(batch_size, c, h, w));

ethanyhzhang · February 18, 2020, 3:58pm

I’ve try in python api before execute_async() function as follows,

context.set_binding_shape(0,trt.tensorrt.Dims([images.shape[0], 112, 112, 3]))

but the outputs are the same. by the way, only the batch_size is variant.

RahnRYHuang · February 20, 2020, 1:09am

Hi, ethanyhzhang:
I’ve solved the issue. The TensorRT program is implemented in C++ and the model contains a custom layer which had not implemented processing multiple images before.

Please check the author’s repository to see how to process multiple images in custom layers:
https://github.com/lewes6369/tensorRTWrapper/blob/0aaab5110d0794c7c374c7f46fbde2050b459556/code/src/YoloLayer.cu

The keyword is “batchSize” which stands for multiple input images.

ethanyhzhang · February 20, 2020, 2:20am

Thanks!

edgetinker · January 7, 2021, 5:50pm

@ethanyhzhang how did you use this to solve the issue in the python API? Dealing with the same issue with batch size of 3 images. thank you!

Topic		Replies	Views
[TensorRT] C++ batch inference gives weird results TensorRT	3	1051	October 12, 2021
TensorRT context.enqueue gives wrong result for all frames other than first TensorRT	0	605	October 15, 2018
Different TensorRT inference results from the same input when batchSize > 1 TensorRT	2	2042	October 12, 2021
Tensorrt inference on multiple batches TensorRT tensorrt , jetson-inference	5	3233	October 27, 2022
TensorRT running inference with batch size > 1 TensorRT tensorrt	8	3738	January 26, 2021
TensorRT 5 builder when set max_batch_size to 8 the output shape? Jetson TX2	3	1454	October 18, 2021
tensorrt execute batch_size >1 images Jetson TX2	2	930	October 18, 2021
Inference multiple images TensorRT TensorRT	8	2297	November 9, 2020
Tensorrt Batch Inference TensorRT tensorrt	8	1602	December 1, 2020
Question about Python tutorial TensorRT	3	546	October 12, 2021

Output of batch inference

Related topics