Python API - int8_calibrator not used when calling build_engine (but works when calling build_cuda_engine)

dudu.asulin · November 24, 2020, 9:37am

Description

Hi,

I’m trying to convert models from PyTorch → ONNX → TensorRT. Optimally, I would like to use INT8 and support dynamic input size.
I seem to be able to create an INT8 calibrated model if I use builder.build_cuda_engine(network) and use optimization profiles for dynamic input support if I use builder.build_engine(network, config).
The latter option seems to always ignore the int8_calibrator regardless if I set it in the builder or the config objects and even if I remove the dynamic shape optimizations (see code snippet below).

Please let me know if what I’m trying here is not supported or any other way to make this work…

Thanks!

Environment

TensorRT Version:
GPU Type: T4
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): nvcr.io/nvidia/pytorch:20.11-py3

Relevant Files

Steps To Reproduce

def build_engine(onnx_file_path, input_name, int8_calibrator=None,
                 max_batch_size=1, img_size=None, min_size=None, max_size=None):
    # initialize TensorRT engine and parse ONNX model
    with trt.Builder(TRT_LOGGER) as builder, builder.create_builder_config() as config:
        builder = trt.Builder(TRT_LOGGER)
        network_creation_flag = 1 << int(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)
        network = builder.create_network(network_creation_flag)
        parser = trt.OnnxParser(network, TRT_LOGGER)

        # parse ONNX
        with open(onnx_file_path, 'rb') as model:
            print('Beginning ONNX file parsing')
            parser.parse(model.read())
        print('Completed parsing of ONNX file')
        # allow TensorRT to use up to 8GB of GPU memory for tactic selection
        config.max_workspace_size = 8 << 30

        # use FP16 mode if possible
        if builder.platform_has_fast_fp16:
            builder.fp16_mode = True
            print('USING FP16!!!')
        if int8_calibrator is not None:
            builder.int8_mode = True
            config.int8_calibrator = int8_calibrator
            builder.int8_calibrator = int8_calibrator
            print('USING INT8!!!', builder.platform_has_fast_int8)

        # # Dynamic input support - commented out for testing (still int8 calibration is not working)
        # if img_size is not None:  # dynamic
        #     opt_min, opt_max = min(img_size), max(img_size)
        #     # landscape profile
        #     profile = builder.create_optimization_profile()
        #     profile.set_shape(input_name, min=(1, 3, min_size, opt_max), opt=(max_batch_size, 3, opt_min, opt_max),
        #                       max=(max_batch_size, 3, opt_max, opt_max))
        #     config.add_optimization_profile(profile)
        #
        #     # portrait profile
        #     profile = builder.create_optimization_profile()
        #     profile.set_shape(input_name, min=(1, 3, opt_max, min_size), opt=(max_batch_size, 3, opt_max, opt_min),
        #                       max=(max_batch_size, 3, opt_max, opt_max))
        #     config.add_optimization_profile(profile)

        # generate TensorRT engine optimized for the target platform
        print('Building an engine...')
        # engine = builder.build_cuda_engine(network)
        engine = builder.build_engine(network, config)
        print("Completed creating Engine")

    return engine

SunilJB · November 24, 2020, 10:48am

Please refer below link:

Thanks

dudu.asulin · November 25, 2020, 10:04am

I found a solution in Int8 calibrate failed while using a new IBuilderConfig · Issue #388 · NVIDIA/TensorRT · GitHub, which is to use config.set_flag(trt.BuilderFlag.INT8) instead of builder.int8_mode = True.
The link to the example script given there is broken, here is the updated link:
tensorrt-utils/onnx_to_tensorrt.py at master · rmccorm4/tensorrt-utils · GitHub

Topic		Replies	Views
Segmentation fault in build_engine when using an int8 calibrator TensorRT	6	1261	October 12, 2021
TensorRT6 Dynamic Input Size does not support int8 with calibrator. TensorRT	13	3457	July 23, 2021
Is there any method to build model with int8 weight in tensorrt? TensorRT	1	1269	July 29, 2021
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	2001	June 29, 2021
INT8 calibration file not generating, not building in INT8 mode TensorRT tensorrt , ubuntu , python , jetson-nano	15	2518	June 4, 2022
Got Assertion `sI.count() == 1' failed. when create engine with INT8 calibration TensorRT tensorrt	5	614	October 12, 2021
Driver error-TensorRT INT8 deploy TensorRT	3	719	November 20, 2020
How to generate int8 calilb table for trtexec engine generation TensorRT tensorrt	7	4584	October 12, 2021
Deepstream -Jetson Xavier NX - Onnx2trt DeepStream SDK	6	629	October 12, 2021
INT8 Calibration Static Engine Issue TensorRT	1	474	August 13, 2019

Python API - int8_calibrator not used when calling build_engine (but works when calling build_cuda_engine)

Description

Environment

Relevant Files

Steps To Reproduce

Related topics