Trtexec and dynamic batch size

copah · June 12, 2020, 4:22pm

Description

I am trying to convert a Pytorch model to TensorRT and then do inference in TensorRT using the Python API.

My model takes two inputs: left_input and right_input and outputs a cost_volume. I want the batch size to be dynamic and accept either a batch size of 1 or 2.

Can I use trtexec to generate an optimized engine for dynamic input shapes?

My current call:

	trtexec \
		--verbose \
		--explicitBatch \
		--minShapes=left_input:1x3x512x512,right_input:1x3x512x512,cost_volume:1x20x128x128 \
		--optShapes=left_input:2x3x512x512,right_input:2x3x512x512,cost_volume:2x20x128x128 \
		--maxShapes=left_input:2x3x512x512,right_input:2x3x512x512,cost_volume:2x20x128x128 \
		--onnx='my_onnx_model' \
		--saveEngine='my_trt_model' \
		--workspace=3000

But I cannot get the inference in Python to work with this model. It works for a batch_size of 1, but with a batch size of 2, only the first batch is correct.

When I load the engine in Python I get:

In [1]: engine.get_binding_name(0)
Out[1]: u'left_input'

In [2]: engine.get_binding_name(1)
Out[2]: u'right_input'

In [3]: engine.get_binding_name(2)
Out[3]: u'cost_volume'

In [4]: engine.get_binding_shape(0)
Out[4]: (-1, 3, 512, 512)

In [5]: engine.get_binding_shape(1)
Out[5]: (-1, 3, 512, 512)

In [6]: engine.get_binding_shape(2)
Out[6]: (1, 20, 128, 128)

In [7]: execution_context = engine.create_execution_context()

In [8]: execution_context.get_binding_shape(0)
Out[8]: (-1, 3, 512, 512)

In [9]: execution_context.get_binding_shape(1)
Out[9]: (-1, 3, 512, 512)

In [10]: execution_context.get_binding_shape(2)
[TensorRT] ERROR: Parameter check failed at: engine.cpp::resolveSlots::1092, condition: allInputDimensionsSpecified(routine)
Out[10]: (0)

I am wondering why the output doesn’t get a dynamic batch.

Any guidance here would be appreciated.

Environment

TensorRT Version: 7.0.0-1+cuda10.0 amd64
GPU Type: GeForce GTX 1070
Nvidia Driver Version: 440.82
CUDA Version: 10.0
CUDNN Version: 7.6.5.32-1+cuda10.0 amd64
Operating System + Version: Ubuntu 18.04
Python Version (if applicable): 3.6
PyTorch Version (if applicable): 1.4.0

AakankshaS · June 12, 2020, 6:29pm

Hi, Can you please share the model and script, so that we can help better.
Also, if possible , please share the verbose error log.

david.packwood · October 21, 2020, 5:12pm

You need to call execution_context.set_binding_shape on your input bindings, to make them have whatever batch you are planning to use.

Only after that can you call get_binding_shape on the output bindings (or use the context at all)

lttazz99 · July 22, 2021, 6:45pm

can confirm this works. thanks!

NVES · July 22, 2021, 7:07pm

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec
In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

Topic		Replies	Views
Dynamic batch size TensorRT	3	4738	January 24, 2023
How to combine TRTIS dynamic batching with TRT engine of dynamic batchsize? Triton Inference Server (archived) tensorrt	0	542	March 25, 2020
AGX Xavier dynamic batches Jetson AGX Xavier tensorrt	2	434	October 18, 2021
Trtexec built engine with a fixed single batch output while dynamic input is allowed TensorRT cudnn	2	310	May 20, 2024
Creating a TensorRT Engine with different batch sizes TensorRT python , onnx	12	2965	August 18, 2020
How to support dynamic batch size for TensorRT engine? TensorRT	1	1178	March 3, 2023
How can I use the dynamic shape of tensorrt ？ TensorRT	3	2298	April 8, 2021
Dynamic batch size for tensorrt Engine TensorRT tensorrt	1	1823	May 30, 2024
ONNX to TensorRT with dynamic batch size in Python TensorRT tensorrt , onnx	4	6378	October 12, 2021
Trtexec create engine failed from onnx when adding dynamic shapes TensorRT	5	2267	June 22, 2021

Trtexec and dynamic batch size

Description

Environment

check_model.py

Related topics