Questions about frequently changing the batch size

439290087 · February 23, 2021, 12:27pm

Description

I used to use context->enqueue(batchsize, …) to infer with implicit batches.
Now I have to use these codes with trt7:
context->setOptimizationProfile(0); // 0 is the first profile, 1 is the second profile, etc.
context->setBindingShape(0, Dims3(batchsize, 3, 384, 1280)); // 0 is the first input binding, you may have multiple input bindings
context->executeV2(…)

If my actual ‘batchsize’ changes time by time, will these extra ‘setBindingDimensions’ codes be the performance issue?

Environment

TensorRT Version: 7.2.2
GPU Type: 2070Super
Nvidia Driver Version: 456.71
CUDA Version: 11.0.3
CUDNN Version: 8.0.5
Operating System + Version: Win10

spolisetty · March 31, 2021, 11:54am

Hi @439290087,

Sorry for the late reply.
There is a small but non-zero overhead of calling setBindingDimensions. The overhead depends on which layers are used in your network. If the overhead is too much, one option would be to maintain multiple ExecutionContexts - one for each batch size of interest.

Thank you.

Topic		Replies	Views
How to deal with dynamic batch size at runtime and explicit batch size TensorRT tensorrt	2	652	April 21, 2020
about working with dynamic shapes TensorRT	5	1134	January 9, 2020
How to set context inputDims when there are two inputs to the network and two outputs TensorRT	3	791	November 29, 2020
Batchsize performance differs greatly in the two application methods of tensorrt TensorRT	2	666	April 4, 2019
How could I change the batchsize during inference when using a tensorRT model converted by onnx? TensorRT	8	4619	October 12, 2021
TensorRT and IOptimizationProfile TensorRT tensorrt , developer	3	1023	April 22, 2021
Dynamic batch size for tensorrt Engine TensorRT tensorrt	1	848	May 30, 2024
The larger the batch size, the better when build engine? TensorRT tensorrt	3	1516	July 29, 2020
Question about using optimization profiles: bindingIndex 0 is not in profile 1 TensorRT	13	1549	January 17, 2022
TensorRT Batching Speed scales poorly TensorRT tensorrt , cuda	6	1692	September 30, 2021

Questions about frequently changing the batch size

Description

Environment

Related topics