When will the int8 mode will be supported in DINO

meenambika.20 · December 29, 2023, 10:16am

Continuing the discussion from DINO-FAN_base INT8 model is not faster than FP16 model:

Hello, We planned to use DINO model instead of Yolor model and since the inference speed is high for fp16. We planned on convert to int8 mode but found that it is not available for DINO. Will int8 mode for DINO will support in future, and if not what do you suggest to improve the inference speed.

Please reply.
Thank you.

C. Meenambika

Morganh · December 29, 2023, 3:31pm

I will sync internally for the feature request for the int8. For improving the inference speed, you can select a smaller backbone. For example, resnet_50, fan_tiny, gcvit_xxtiny.

More can be found in DINO - NVIDIA Docs and Overview - NVIDIA Docs

meenambika.20 · January 4, 2024, 6:49am

Hi, Have few doubts, You have stated int8 is not supported for DINO, can I know what then here DINO | NVIDIA NGC what does the int8 stated here refer to. Any assistance provided regarding int8 mode is highly appreciated.

Thank you

C .Meenambika

Morganh · January 4, 2024, 6:55am

Good catching. There should be typo in the model card. Currently, int8 is not supported for DINO. Source code is in tao_pytorch_backend/nvidia_tao_pytorch/cv/dino at 99e0a38a0d3ac00997c41c7e6ea6f02c6586bf4f · NVIDIA/tao_pytorch_backend · GitHub.

meenambika.20 · January 4, 2024, 7:10am

Then what would be the mode of the model which was used in this speed comparison.

Morganh · January 4, 2024, 7:58am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

It is FP16. Please refer to Overview - NVIDIA Docs.

system · January 23, 2024, 1:37am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
DINO-FAN_base INT8 model is not faster than FP16 model TAO Toolkit tensorrt , deepstream	5	672	November 8, 2023
Pruning Dino model TAO Toolkit	8	591	December 26, 2023
INT8 for jetson nano TAO Toolkit	4	455	October 12, 2021
Same inference speed for INT8 and FP16 TensorRT	10	5673	October 12, 2021
TRT Engin in INT8 is much slower than FP16 TensorRT	4	1844	November 11, 2021
FP16 --half=true option doesn't work on GTX 1080 TI although it runs ./sample_int8 INT8 GPU-Accelerated Libraries	2	4808	August 23, 2017
Support INT8 precision in Jetson Nano Jetson Nano	8	1809	October 14, 2021
Dino FAN small inference time is high TAO Toolkit	2	356	January 19, 2024
The inference speed of yolov5 tensorrt has little difference between int8 and fp16 TensorRT tensorrt , cuda	1	1441	September 8, 2022
Why is' int8 'not as fast as' fp16' TensorRT tensorrt	1	549	February 1, 2021

When will the int8 mode will be supported in DINO

Related Topics