The default value of engine.max_batch_size is 32?

Chieh · March 17, 2020, 9:20am

Hi all,

Now I have a trt engine which is converted from onnx2trt.
When I load this engine and directly see what its max_batch_size is, it shows 32.
However, I just wanna to test only one image, and I cannot set the engine.max_batch_size value. (Even I already set max_batch_size is 1, but the value of what I print seems like different between engine.max_batch_size and max_batch_size)

As the engine.max_batch_size is 32, it will create a wrong buffer during the allocate_buffers(engine) stage.

In the infer() stage, there is a step below:

np.copyto(self.inputs[0].host, img.ravel())

The output is

self.inputs[0].host 88473600
img.ravel() 2764800

Because of the engine.max_batch_size 32, we can know 32*2764800 = 88473600.
It makes me wrong on here.

See :

def load_engine(trt_runtime, engine_path):
    with open(engine_path, 'rb') as f:
        engine_data = f.read()
    engine = trt_runtime.deserialize_cuda_engine(engine_data)
    print("Engine.max_batch_size",engine.max_batch_size)
    return engine

Output:

Engine.max_batch_size 32

I have some questions for this thing.

Why is the default of engine.max_batch_size 32?
How to setup the engine.max_batch_size? (Not normal max_batch_size)

Thanks in advance!

Chieh · March 17, 2020, 9:29am

o Linux distro ; Ubuntu 18.04
o GPU type : 1060
o Nvidia driver version : 440
o CUDA version : 10.0
o CUDNN version : 7.6.5
o Python version [if using python] : 3.6.9
o Tensorflow and PyTorch version : TF 1.14
o TensorRT version : 7.0.0.11

SunilJB · March 17, 2020, 9:39am

Hi,

Default max batch size in onnx2trt is 32. Please refer below link:

github.com

onnx/onnx-tensorrt/blob/dbd5585a223a8b844e616761ad3a83242fb9a30e/main.cpp#L47


      
          #include <ctime>
          #include <fcntl.h> // For ::open
          #include <limits>
          
          
void print_usage() {
            cout << "ONNX to TensorRT model parser" << endl;
            cout << "Usage: onnx2trt onnx_model.pb" << "\n"
                 << "                [-o engine_file.trt]  (output TensorRT engine)" << "\n"
                 << "                [-t onnx_model.pbtxt] (output ONNX text file without weights)" << "\n"
                 << "                [-T onnx_model.pbtxt] (output ONNX text file with weights)" << "\n"
                 << "                [-b max_batch_size (default 32)]" << "\n"
                 << "                [-w max_workspace_size_bytes (default 1 GiB)]" << "\n"
                 << "                [-d model_data_type_bit_depth] (32 => float32, 16 => float16)" << "\n"
                 << "                [-l] (list layers and their shapes)" << "\n"
                 << "                [-g] (debug mode)" << "\n"
                 << "                [-v] (increase verbosity)" << "\n"
                 << "                [-q] (decrease verbosity)" << "\n"
                 << "                [-V] (show version information)" << "\n"
                 << "                [-h] (show help)" << endl;
          }

You can use either -b option to generate engine with different max batch size or you can use directly TRT APIs to set the max batch size, please refer below link:

Thanks

Chieh · March 17, 2020, 9:42am

Thanks!

So If I use the onnx parser to generate the trt engine, it should not meet this problem, right?

Topic		Replies	Views
Load ONNX model with batch size TensorRT	3	1736	October 12, 2021
TensorRT runtime batch processing in C++ TensorRT tensorrt	5	1549	September 8, 2021
Trt file from onnx is too large TensorRT	1	896	March 10, 2021
Torchvision's Maxvit ONNX TensorRT - output different for batch size > 1 TensorRT cudnn	2	25	January 31, 2025
How could I change the batchsize during inference when using a tensorRT model converted by onnx? TensorRT	8	4650	October 12, 2021
Tensorrt inference on multiple batches TensorRT tensorrt , jetson-inference	5	2938	October 27, 2022
How to support dynamic batch size for TensorRT engine? TensorRT	1	1037	March 3, 2023
TensorRT engine produces incorrect results TensorRT tensorrt , tensorflow , onnx	10	1798	October 29, 2020
Creating a TensorRT Engine with different batch sizes TensorRT python , onnx	12	2761	August 18, 2020
TRT Uses INT 32 VS INT 16 TensorRT	3	979	October 12, 2021

The default value of engine.max_batch_size is 32?

Related topics