Could you please give some examples one the INT8 optimization, including calibration and so on?
you know, the documents here [Developer Guide :: NVIDIA Deep Learning TensorRT Documentation] is too short for the details! What’s the ImageBatchStream? How to import it? How to construct calibration files? When I search the Internet, many examples shows this:
import tensorrt as trt
Int8_calibrator = trt.infer.EntropyCalibrator()...
The question is, in the version 5.0.2 of tensorrt, it does not have the attribute of infer or module named tensorrt.infer, OK?