Runtime Performance Decreased while using int8 - tflite

bhardwajsapna04 · July 20, 2021, 11:29am

Hi,
We are trying to run performance test for ResNet50 tflite based model on various hardwares. While running on Nano, as expected, we gain the runtime performance from 1.16 sec(FP32) to 0.64 sec(INT8).

However, while running the same experiment on Xavier, the performance drops from 0.72 sec(FP32) to 1.08 sec(INT8).

Any guidance/blog to follow for running tflite models on jetson devices. Am I am doing something wrong?

Thanks in advance,
Sapna

AastaLLL · July 21, 2021, 3:06am

Hi,

Nano doesn’t support INT8 inference due to hardware limitations.
Could you double-check if the inference fallback to other precision instead?

First, please make sure you have maximized the device performance as below:

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

For tflite model, you can deploy it with TensorFlow or TensorRT.
TensorRT can give you an optimized performance but need an ONNX format as intermediate.

You can find the benchmark table for Jetson on ResNet50 below.
We can get 824fps (only inference) on XavierNX:

Thanks.

system · September 27, 2021, 1:06pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
TensorRT INT8 conversion lack of performance increase. Jetson Nano	2	804	October 15, 2021
Can Jetson nano import int8? Jetson Nano tensorrt	6	1820	October 15, 2021
Unable to inference a trt model in jetson nano/ xavier nx Jetson TX2 tensorrt , jetson-inference	3	1035	March 2, 2022
TensorRT int8 performance Jetson AGX Xavier	4	1327	October 18, 2021
Requesting INT8 data type but platform has no support, ignored DeepStream SDK deepstream	4	678	December 12, 2022
Failed to use INT8 precision mode when using tf-trt on Xavier Jetson AGX Xavier	4	1037	October 18, 2021
Onnx to int8trt issue Jetson Nano tensorrt , ubuntu , python	5	783	October 15, 2021
Running tflite models on Orin Nano Jetson Orin Nano tensorrt	5	230	August 13, 2024
Benchmarck int8 similar to fp32 on yolov8 from ultralytics Jetson Orin Nano tensorrt , yolo	6	1605	December 18, 2023
Inference Speed Jetson Xavier NX pytorch	6	1006	April 12, 2023

Runtime Performance Decreased while using int8 - tflite

Related topics