TensorRT INT8 conversion lack of performance increase.

harsha.bommana · October 10, 2019, 6:12am

I currently have a keras model for YoloV3 (GitHub - qqwweee/keras-yolo3: A Keras implementation of YOLOv3 (Tensorflow backend)) and I have extracted the underlying Tensorflow frozen graph. I am also able to successfully convert the frozen graph into tensorRT optimized graphs using the TF-TRT API (TF version is 1.14). I have tried both FP16 and INT8 precision modes but both are giving me same performance (4-5 FPS) on the Jetson Nano. I have also calibrated the INT8 model against some training data. What could be the issue here?

AastaLLL · October 14, 2019, 3:27am

Hi,

Please noticed that not all the platform support INT8 operations.
For Jetson system, only Xavier has Tensor cores hardware and be able to support the INT8 mode.

If you get the similar performance between FP32 and FP16, the layers of your model might be fallbacked into TensorFlow implementation.
You can check this information from the output log first.

Thanks.

Topic		Replies	Views
Inference using FP16 and FP32 precision giving no performance gain on Jetson Nano Jetson Nano	2	1341	October 14, 2021
Runtime Performance Decreased while using int8 - tflite Jetson Xavier NX tensorflow	2	1062	September 27, 2021
Yolov3 int8 on tensorrt 7.1.0.16 Jetson Xavier NX tensorrt	4	850	October 18, 2021
Quantization in TensorRt Jetson Nano tensorrt	6	1889	March 2, 2022
Jetson AGX Xavier INT8 Performance Jetson AGX Xavier	4	1764	October 18, 2021
Issues with Jetson Xavier - TensorRT Jetson AGX Xavier tensorrt	4	352	October 18, 2021
No performance improvement for Tensorflow TensorRT model on converted on Jetsons Xavier NX Jetson Xavier NX tensorrt , tensorflow	2	677	October 18, 2021
Can Jetson nano import int8? Jetson Nano tensorrt	6	1665	October 15, 2021
converting a frozen graph to tensorRT Jetson Nano	5	1788	October 14, 2021
TensorRT Optimization for Tensorflow-Unet-Image-segmentation TensorRT tensorrt , tensorflow , nano	1	1167	August 4, 2021

TensorRT INT8 conversion lack of performance increase.

Related topics