Batch size > 1 and max workspace

LoveNvidia · May 16, 2020, 12:25am

Hi all,
Q-1 As you know, in the TensorRT 5/6, batch size >1 has a problem, It’s perform image by image instead of all images of batch at same time. I want to know this problem is occurred in UFF parser? Is this problem solved by ONNX parser even in the TensrRT 5/6? I know this problem solved in the TensrRT 7 with profiler.

Q-2, On what basis, we define the value of max workspace in the converting to TensorRT? If the value that we define be smaller than min size, The process of convert will be stop? I want to know, How do we set optimal value for workspace?

SunilJB · May 18, 2020, 4:08am

Q1: In my understanding i don’t think TRT 5/6 has such behavior for batch size > 1.
Only if single batch size is utilizing the complete compute then the processing will be per image.
TRT 7 optimization profiles are to handle dynamic shapes and optimize model for specific input shape range for better performance.
Could you please share the repro script and model file along with system configuration related information so we can help better?

Q2: TensorRT Developer Guide :: NVIDIA Deep Learning SDK Documentation
Section “How do I choose the optimal workspace size?” should cover the explanation.
If workspace size is low TRT conversion will fail or not optimized with better performing algorithm.

Thanks

LoveNvidia · May 18, 2020, 11:38am

Thanks.
1- I used this link. Is it possible please help me to modify this code for using profiler in TRT7.

SunilJB · May 19, 2020, 5:35am

Please refer to below samples:

Thanks

LoveNvidia · May 19, 2020, 11:21am

Thanks

Topic		Replies	Views
TensorRT - max_batch_size issue Jetson TX2	6	1577	October 18, 2021
TensorRT runtime batch processing in C++ TensorRT tensorrt	5	1531	September 8, 2021
TensorRT engine produces incorrect results TensorRT tensorrt , tensorflow , onnx	10	1784	October 29, 2020
Dynamic batch Tensor-RT inference output is incorrect TensorRT tensorrt , python	2	1275	May 25, 2023
TRT inference on batches is not giving any performance benefit Jetson TX2 tensorrt , nvbugs	11	1157	October 18, 2021
TensorRT 5.X / 6.X Batch Size Problem TensorRT	4	606	August 19, 2020
The default value of engine.max_batch_size is 32? TensorRT	4	1748	October 12, 2021
setMaxWorkspaceSize parameter clarification TensorRT	1	2000	February 25, 2021
why tensorrt engine inference speed become much slower if I increase the input image size TensorRT	1	1049	December 18, 2019
TensortRT execute with variable batch size gave incorrect results TensorRT	1	423	November 9, 2021

Batch size > 1 and max workspace

Related topics