How to reproduce the inferencing performance with INT8 on T4 or A2

113736752 · September 4, 2022, 1:18pm

Description

A clear and concise description of the bug or issue.

Environment

TensorRT Version:
GPU Type:
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

I want to reproduce the inferencing performance with INT8 on T4 or A2, but I don’t know how to reproduce and compare with the inferencing performance NVIDIA updated monthly in following page, could someone give some instructions, thanks.

NVES · September 4, 2022, 1:37pm

Hi,
Please refer to below links related custom plugin implementation and sample:

While IPluginV2 and IPluginV2Ext interfaces are still supported for backward compatibility with TensorRT 5.1 and 6.0.x respectively, however, we recommend that you write new plugins or refactor existing ones to target the IPluginV2DynamicExt or IPluginV2IOExt interfaces instead.

Thanks!

113736752 · September 4, 2022, 1:50pm

Hi,
Thank you for your quick feedback, I will try it later.

Topic		Replies	Views
Is there any layer that fp16 supports but int8 does not？ TensorRT	5	478	December 1, 2021
TensorRT INT8 inference accuracy TensorRT	2	493	May 9, 2022
Detectron2: faster inferencing TensorRT	2	1380	April 29, 2022
Inference time using TF-TRT is the same as Native Tensorflow for Object Detection Models TensorRT tensorrt , tf-trt	4	1000	March 31, 2022
TensorRT 3: Faster TensorFlow Inference and Volta Support Technical Blog	16	462	December 8, 2020
Acceleration with INT8 precision using TensorRT TensorRT tensorrt , cuda , deep-learning	6	740	February 13, 2021
TensorRT TensorRT tensorrt , python	1	317	October 27, 2021
Could not find any supported formats consistent with input/output data types TensorRT	1	809	April 11, 2023
TensorRT INT8 calibration TensorRT tensorrt , cuda , tensorflow	4	1023	February 15, 2021
How to reproduce nvidia product performance result? General tensorrt	0	634	March 14, 2023

How to reproduce the inferencing performance with INT8 on T4 or A2

Description

Environment

Relevant Files

Steps To Reproduce

Related topics