I have successfully converted ResNet-r50 to fp16 using TensorRT with Python and C++ but I am unable to do the same with INT8 precision. I cant quite understand the calibration step involved with the acceleration using the official documentation.
Can anyone help me with understanding the calibration? A good tutorial or reference links might help.
Thanks in advance.
Environment
TensorRT Version: 7.2.2.1 GPU Type: nvidia RTX 3080 Nvidia Driver Version: 460.27.04 CUDA Version: 11.2 Operating System + Version: LINUX 18.04 Python Version: 3.6 TensorFlow Version: 2.3.1
I have referred to these links but I am still having difficulty in converting my custom model. If you have any good tutorials, it would be very helpful.
I am new to the field so sorry if these queries basic or obvious.