Can we do INT8 inference using python API?

qjfytz · December 6, 2018, 7:10am

I only find a very simple instructions for int8 inference using python API. It says:

import tensorrt as trt
NUM_IMAGES_PER_BATCH = 5
batchstream = ImageBatchStream(NUM_IMAGES_PER_BATCH, calibration_files)

However, I cannot find the definition of ImageBatchStream in python API, so I don’t know how to do the following steps. I also checked the samples, but can only find INT8 samples writing in C++. And class BatchStream is defined at a header file in samples.

So can we do INT8 inference using python API? If we can, how to build the data pipeline?

NVES · December 6, 2018, 4:49pm

Hello,

Please reference developer guide on how to set precisions with python. https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#enable_int8_python

You can also reference this example which demonstrates how to use TensorRT to improve the inference performance by using INT8 reduced precision.

regards,
NVIDIA Enterprise Support

qjfytz · December 7, 2018, 3:37am

Thanks for your reply. I have written a program referring to https://devblogs.nvidia.com/int8-inference-autonomous-vehicles-tensorrt/. However, there still exists a problem in the definition of Int8Calibrator::write_calibrator_cache().
In the example, It accept a parameter ‘ptr’, and convert ptr by int(ptr). However, in 5.0.2.6, This function accept this parameter in type ‘data:capsule’, and int(ptr) will cause an error.

How to fix this? I think this problem is caused by the api difference between tensorrt 3 and 5

NVES_R · October 28, 2019, 8:56pm

Hi qjfytz,

You can see an example of a more up-to-date int8 calibration class using TensorRT 6.0 here: [url]https://devtalk.nvidia.com/default/topic/1065026/tensorrt/tensorrt6-dynamic-input-size-does-not-support-int8-with-calibrator-/post/5393304/#5393304[/url]

Topic		Replies	Views
TensorRT 5 Int8 Calibration Example TensorRT	11	7602	October 12, 2021
TensorRT 4.0 Python API INT8 Calibration TensorRT	3	1412	August 27, 2018
Important about the documents of INT8 enable!!! TensorRT	4	1248	February 20, 2019
INT8 Calibration in Python with TensorRT 8.6 TensorRT tensorrt	5	3101	July 12, 2023
TensorRT TensorRT tensorrt , python	1	317	October 27, 2021
How to do int8 calibration in c++ in tensorRT 5 ? TensorRT	10	4765	October 12, 2021
Fast INT8 Inference for Autonomous Vehicles with TensorRT 3 Technical Blog	6	446	March 27, 2018
TensorRT INT8 calibration TensorRT tensorrt , cuda , tensorflow	4	1023	February 15, 2021
TensorRT INT8 calibration in C++ api TensorRT tensorrt	2	1761	February 14, 2022
Int8 quantization TensorRT	1	477	December 16, 2021

Can we do INT8 inference using python API?

Related topics