Which type/format of data is needed for int8 calibration

oPlusss · December 6, 2022, 10:31am

Hi
Thanks for providing such powerful Tensorrt. In order to maximize the efficiency, we are using the dla with standalone mode, and using Int8 as input/output data type. Also we set the flag to allow all formats of input/output.
But we don’t know which type/format of input data we need to prepare.

1> When all of input is Int8, the input data in calibrator will be fp32 or fp16 or Int8?
2> Will trt reformat the input automatically?

NVES · December 6, 2022, 11:07am

Hi, Please refer to the below links to perform inference in INT8

Thanks!

oPlusss · December 14, 2022, 2:29am

@NVES Thanks for your reply.

By referencing the code A
and B, it looks like the input is always of FLOAT & LINEAR. But is this based on the premise of FLOAT & LINEAR network inputs or works on any input type & format?

spolisetty · December 21, 2022, 10:11am

Yes, It need to be float32. Please refer to the following doc for more details.

Thank you.

system · January 30, 2023, 3:36am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

ramc · February 9, 2023, 4:23pm

Also check out the DLA github page for samples and resources or to report issues: Recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.

We have a FAQ page that addresses some common questions that we see developers run into: Deep-Learning-Accelerator-SW/FAQ

Topic		Replies	Views
How to use DLA + INT8 + I/O reformatting? TensorRT tensorrt , dla	1	652	June 13, 2023
TensorRT INT8 calibration in C++ api TensorRT tensorrt	2	1909	February 14, 2022
What should I do for INT8 Calibration for Non-Image input like "float vector"? TensorRT	0	783	May 17, 2019
INT8.input_type() and INT32.input_type() TensorRT	1	953	September 11, 2018
Examples of more diverse usage of TensorRT IInt8Calibrator available elsewhere? GPU-Accelerated Libraries	0	517	July 21, 2017
TensorRT TensorRT tensorrt , python	1	353	October 27, 2021
Alexnet using INT8 GPU-Accelerated Libraries	5	1746	August 29, 2017
How to transfer int8 to float32, and do not use int8 calibrate TensorRT	3	1070	August 11, 2020
Can we do INT8 inference using python API? TensorRT	3	2162	October 28, 2019
TensorRT 3: INT8 on GTX 1060 GPU-Accelerated Libraries	0	1262	February 7, 2018

Which type/format of data is needed for int8 calibration

Related topics