Description
I am trying to convert an FP32 ONNX model to INT8. One technique for conversion is to have a file with the dynamic range of each tensor (used for building the engine). I am trying to find example of capturing the dynamic range as a Python script, but have yet to find an example. I am assuming I run my validation set through the network and save the min/max for each tensor. Could you point me to an example? Thanks!
Environment
TensorRT Version:
GPU Type: AGX Xavier
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version: Ubuntu 18.04
Python Version (if applicable): 3.6.9
TensorFlow Version (if applicable):
PyTorch Version (if applicable): 1.8.0
Baremetal or Container (if container which image + tag):