TenorRT with python: execution return zeros if batch_size > 1

bartptak · November 13, 2020, 6:16pm

Description

I trained the yolov4 model and used this path: darknet → onnx → tensorrt.
My device is JETSON XAVIER NX.
I use nvcr.io/nvidia/l4t-tensorflow:r32.4.3-tf2.2-py3.

Issue

If I use batch_size = 1 it works fine.
If I use batch_size> 1, the prediction only works for the first batch, the next ones are zero-padded.

Environment

TensorRT Version: 7.1.3.0
GPU Type: JETSON XAVIER NX
Nvidia Driver Version:
CUDA Version: 10.2
CUDNN Version: 8
Operating System + Version: Linux
Python Version (if applicable): 3.6.9
TensorFlow Version (if applicable): 2.2
PyTorch Version (if applicable): -
Baremetal or Container (if container which image + tag): nvcr.io/nvidia/l4t-tensorflow:r32.4.3-tf2.2-py3

Steps To Reproduce

Please include:

I I trained yolov4 model with darknet. (cfg, weights)
I used pytorch-YOLOv4 repository for conversion: darknet → onnx:

python demo_darknet2onnx.py yolov4-tiny-3l.cfg yolov4-tiny-3l_final.weights img.jpg 16

The resulting model: yolov4_16_3_608_608_static.onnx

Then I used trtexec to generate the engine and make predictions in my code. Everything works fine, but only if batch_size = 1. I have problems when batch_size=16 and tried options:

with --batch=16 param

trtexec --onnx=yolov4_16_3_608_608_static.onnx --saveEngine=engine_b16_fp32.engine --workspace=4096 --buildOnly --batch=16

The resulting model: engine_b16_fp32.engine

Outputs from inference:

[[(0, 0.9952153, 877, 848, 113, 58), (0, 0.99335945, 1199, 171, 103, 80), (0, 0.99015826, 123, 147, 142, 54), (0, 0.98760045, 1483, 315, 73, 56), (0, 0.98437995, 134, 511, 74, 54)], [], [], [], [], [], [], [], [], [], [], [], [], [], [], []]

with --explicitBatch option

trtexec --onnx=yolov4_16_3_608_608_static.onnx --saveEngine=engine_uavvaste_yolov4_tiny_3l_608_b16_fp32.engine --workspace=4096 --buildOnly --explicitBatch

Outputs from inference:

[TensorRT] ERROR: Parameter check failed at: engine.cpp::enqueue::387, condition: batchSize > 0 && batchSize <= mEngine.getMaxBatchSize(). Note: Batch size was: 16, but engine max batch size was: 1

AakankshaS · November 20, 2020, 6:49pm

Hi @bartptak,
Your model is not dynamic input model, hence kindly convert your model with dynamic batch input size.

Thanks!

Topic		Replies	Views
Batch Inference Wrong in Python API TensorRT	15	3692	October 12, 2021
TensorRT Engine batch inference only has one result TensorRT tensorrt	9	1267	October 12, 2021
TensorRT Batch Inference: different results TensorRT	4	4413	December 1, 2021
Tensorrt inference with batch > 1 TensorRT	4	1477	October 13, 2022
Dynamic batch Tensor-RT inference output is incorrect TensorRT tensorrt , python	2	1404	May 25, 2023
TensorRT Batch Inferences : empty outputs TensorRT tensorrt , jetson-inference	8	2070	July 18, 2024
Question about Python tutorial TensorRT	3	596	October 12, 2021
ONNX to TensorRT Python module doesn't generate dynamic batch size engine TensorRT tensorrt , cudnn , onnx	3	1145	October 20, 2023
ONNX to TensorRT with dynamic batch size in Python TensorRT tensorrt , onnx	4	6392	October 12, 2021
A problem of batchsize when convert from onnx to engine file General Topics and Other SDKs tensorrt	1	424	December 6, 2021

TenorRT with python: execution return zeros if batch_size > 1

Description

Issue

Environment

Steps To Reproduce

Related topics