I have successfully converted ResNet-r50 to fp16 using TensorRT with Python and C++ but I am unable to do the same with INT8 precision. I cant quite understand the calibration step involved with the acceleration using the official documentation.
Can anyone help me with understanding the calibration? A good tutorial or reference links might help.
Thanks in advance.
TensorRT Version: 188.8.131.52
GPU Type: nvidia RTX 3080
Nvidia Driver Version: 460.27.04
CUDA Version: 11.2
Operating System + Version: LINUX 18.04
Python Version: 3.6
TensorFlow Version: 2.3.1