Int8 inference result is differ between 2080ti and Xavier

992355092 · June 6, 2022, 8:54am

Description

I used the same int8 calibration to build engine in 2080ti and Xavier, 2080ti’s version is tensorrt8, Xavier’s is tensorrt7, I’m sure the two versions calibration have the same key-value（layer:scale）,but result of two HW platforms differ greatly，The accuracy of 2080ti’s result is higher than that of Xavier, how to solve this problem? thanks

Environment

TensorRT Version:
GPU Type:
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

spolisetty · June 6, 2022, 8:59am

Hi,

We recommend you to please use the latest TensorRT version on Xavier to get the best performance, there are known issues resolved in the latest version.

Thank you.

NVES · June 6, 2022, 9:07am

Hi, Please refer to the below links to perform inference in INT8

Thanks!

992355092 · June 6, 2022, 9:26am

Thank you for your reply. I have another question, if different version of trt’s int8 calibration has the same key-value(layer name : scale ), does it means the INT8 calibration table can be compatible, and the results are similar？

992355092 · June 7, 2022, 3:09am

Thank you for your reply. I have another question, if different version of trt’s int8 calibration has the same key-value(layer name : scale ), does it means the INT8 calibration table can be compatible, and the results are similar？

992355092 · June 7, 2022, 2:34pm

HI,
I use the TensorRT8.0 version on Xavier and directly set each layer dynamic range, but result still far from 2080ti’s int result,how can I solve it?and would you tell me what issues have been solved?

spolisetty · June 7, 2022, 2:49pm

Hi,

The calibration results are not guaranteed to be transferrable between releases.
However, the most common cause of this phenomenon is different layer precisions decisions between the two engines.
We suggest you to please check this first.

Thank you.

992355092 · June 7, 2022, 3:08pm

thanks, according to my understanding, with the same model parameters and the same scaling factor, the output of the model should be the same. What are the reasons for this error? What tools are available to see the accuracy error between layers？

spolisetty · June 8, 2022, 10:03am

Hi,

Even if we have the same scaling factor, it’s not guaranteed that during runtime layers precision would be the same. Please verify layer precision in the logs.

Thank you.

Topic		Replies	Views
Int8 get the same result, but in FP16 the result is correct TensorRT	1	393	December 1, 2021
INT8 inference with different results TensorRT	5	1198	October 5, 2018
Calibration file problem between different versions of tensorrt? TensorRT	5	528	July 18, 2023
INT8 cache portable across TRT versions? TensorRT	5	715	March 23, 2022
INT8 (8-bit inference, post-training quantization) on Windows 10 is much slower than Ubuntu 20.04 TensorRT	5	719	September 23, 2022
TensorRT TensorRT tensorrt , python	1	317	October 27, 2021
Decreased performance from FP16 to INT8 in TF-TRT on Jetson Xavier General	12	2683	October 12, 2021
Failed to use INT8 precision mode when using tf-trt on Xavier Jetson AGX Xavier	4	965	October 18, 2021
TensorRT INT8 inference accuracy TensorRT	2	495	May 9, 2022
Is there any layer that fp16 supports but int8 does not？ TensorRT	5	478	December 1, 2021

Int8 inference result is differ between 2080ti and Xavier

Description

Environment

Relevant Files

Steps To Reproduce

Related topics