TensorRT Engine batch inference only has one result

Wenbin_Xu · June 3, 2020, 8:42am

Description

I used onnx2trt to generate engine file with batchsize 1. And the inference result is correct. Then I tried setting batchsize to 64 and generate a new engine and do inference. But the output is the same as before. My model is a classification network with 10 classes. With batchsize of 64, I’d expect the output to be of shape (64, 10) and all 64 outputs are the same. However, only the (1, 10) element is correct, the rest 63 are all 0.

This is the inference code I used: https://drive.google.com/file/d/1msXAYG9IbIxY1sLZyXdR52ZAtBSBN161/view?usp=sharing

Can anyone take a look and tell me what is wrong with it? Thanks.

Environment

freshly installed Jetpack 4.4 DP

SunilJB · June 3, 2020, 7:45pm

Can you share the sample script and model file to reproduce the issue so we can help better?

Thanks

Wenbin_Xu · June 4, 2020, 12:42am

Thank you for your reply.

The python code I used is here: test.py - Google Drive

And the trt engine is here: cls.trt - Google Drive

SunilJB · June 4, 2020, 3:54am

The generated plan files are not portable across platforms or TensorRT versions.
Could you please share the ONNX file as well so that i can generate the trt engine using that model file?

Thanks

Wenbin_Xu · June 4, 2020, 8:09am

Sorry I forgot about it.

Here is the onnx model: cls.onnx - Google Drive

Thanks.

Wenbin_Xu · June 8, 2020, 6:53am

@SunilJB Hi, is there any update on this issue? Thanks.

SunilJB · June 8, 2020, 10:30am

We are looking into it, will update you accordingly.

Thanks

SunilJB · June 9, 2020, 6:29pm

Since TRT >= 7 requires EXPLICIT_BATCH for ONNX, for fixed-shape model, the batch size is fixed.
You have to use a dynamic shape model in this case.
Please find below link with a minimal example of exporting Alexnet from PyTorch with dynamic batch size here: https://gist.github.com/rmccorm4/b72abac18aed6be4c1725db18eba4930

Thanks

Wenbin_Xu · June 10, 2020, 1:49am

Thank you for your update.

After generating the dynamic batch size onnx model, the onnx2trt tool cannot parse the onnx model to trt engine.

What I tried afterwards is when exporting onnx model, use dummy data with explicit batchsize, (in my case, 64x3x192x48 instead of 1x3x192x48). The exported onnx will be set to use batchsize of 64. And when using onnx2trt, just set batchsize to 1.

Topic		Replies	Views
Batch Inference Wrong in Python API TensorRT	15	3558	October 12, 2021
Dynamic batch Tensor-RT inference output is incorrect TensorRT tensorrt , python	2	1329	May 25, 2023
TenorRT with python: execution return zeros if batch_size > 1 TensorRT	1	805	November 20, 2020
A problem of batchsize when convert from onnx to engine file General Topics and Other SDKs tensorrt	1	388	December 6, 2021
TensorRT Batch Inferences : empty outputs TensorRT tensorrt , jetson-inference	8	1944	July 18, 2024
How to support dynamic batch size for TensorRT engine? TensorRT	1	1107	March 3, 2023
Creating a TensorRT Engine with different batch sizes TensorRT python , onnx	12	2811	August 18, 2020
Tensorrt inference on multiple batches TensorRT tensorrt , jetson-inference	5	3147	October 27, 2022
ONNX to TensorRT Python module doesn't generate dynamic batch size engine TensorRT tensorrt , cudnn , onnx	3	1089	October 20, 2023
TensorRT Batch Inference: different results TensorRT	4	4248	December 1, 2021

TensorRT Engine batch inference only has one result

Description

Environment

Related topics