[TensorRT] batch vs shapes

WhaSukGO · September 14, 2023, 7:07am

I have exported my Torch model with the following command:

    torch.onnx.export(
        model, dummy_input, outfile, verbose=verbose,
        input_names=['input_batch'], output_names=output_names,
        # keep_initializers_as_inputs=True,
        opset_version=11,
        do_constant_folding=True,
        dynamic_axes={
            'input_batch': {0: 'dynamic'},
            'cif': {0: 'dynamic'},
            'caf': {0: 'dynamic'},
        },
    )

The shape of input is, (1x3x369x641, CHW)

The saved model is openpifpaf_resnet50_641_369.onnx

Now, I want to convert it to TensorRT engine file, with the batch size of 32.

I won’t change the width and height of input, but am interested in batch inference.

Which one is the correct command to use?

trtexec --onnx=openpifpaf_resnet50_641_369.onnx \
    --verbose \
    --explicitBatch \
    --minShapes=input_batch:1x3x369x641 \
    --maxShapes=input_batch:32x3x369x641 \
    --optShapes=input_batch:32x3x369x641 \
    --saveEngine=openpifpaf-resnet50-dynamic_b32.onnx \
    --fp16 \
    --workspace=16000
    
trtexec --onnx=openpifpaf_resnet50_641_369.onnx \
    --verbose \
    --explicitBatch \
    --batch=32 \
    --saveEngine=openpifpaf-resnet50-dynamic_b32_2.onnx \
    --fp16 \
    --workspace=16000

Also, in trtexec command, what is a difference between batch and the batch inside a shape?

spolisetty · September 25, 2023, 2:25pm

Hi,

Please refer to the following document, which may help you:

Thank you.

Topic		Replies	Views
Creating a TensorRT Engine with different batch sizes TensorRT python , onnx	12	2965	August 18, 2020
TensorRT : Does TensorRT interpret the first dimension of layers shapes as batch size? TensorRT tensorrt	2	928	October 12, 2021
Dynamic batch size TensorRT	3	4738	January 24, 2023
How to convert .onnx to .trtmodel tensorrt using trtexec with size output optional TensorRT	2	1980	September 11, 2024
Trtexec and dynamic batch size TensorRT	4	5601	July 22, 2021
Trtexec : Static model does not take explicit shapes since the shape of inference tensors will be determined by the model itself TensorRT	6	3506	June 8, 2022
How to combine TRTIS dynamic batching with TRT engine of dynamic batchsize? Triton Inference Server (archived) tensorrt	0	542	March 25, 2020
Question about Python tutorial TensorRT	3	594	October 12, 2021
Batch Inference Wrong in Python API TensorRT	15	3687	October 12, 2021
TensorRT Engine batch inference only has one result TensorRT tensorrt	9	1253	October 12, 2021

[TensorRT] batch vs shapes

Related topics