Problem related saving calibration cache

cpchiu · August 3, 2020, 3:37am

I am working on conversion of an ONNX model to TRT. I successfully converted the ONNX model from TRT in FP16 and it could run on the DeepStream. However, when I tried to convert the ONNX model to INT8 precision mode. It could only generate the engine file without calibration cache. I already implemented the INT8 calibrator with write_calibration_cache function. Can anyone tell me how to save the calibration cache ? Thanks.

AakankshaS · August 3, 2020, 6:59am

Hi @cpchiu,
Request you to check the sample link to validate on any missing step.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/sampleINT8

Thanks!

cpchiu · August 4, 2020, 3:47am

@AakankshaS Thanks. After replacing the build_engine() with build_cuda_engine(). However, I do not know why build_engine() is not entering the calibration while build_cuda_engine() does. With build_cuda_engine(), the conversion enters the INT8 conversion pipeline but error occurs as the following.

[TensorRT] VERBOSE: Total Host Persistent Memory: 306992
[TensorRT] VERBOSE: Total Device Persistent Memory: 265933824
[TensorRT] VERBOSE: Total Weight Memory: 0
[TensorRT] VERBOSE: Builder timing cache: created 141 entries, 1340 hit(s)
[TensorRT] VERBOSE: Engine generation completed in 94.061 seconds.
[TensorRT] VERBOSE: Calculating Maxima
[TensorRT] INFO: Starting Calibration.
2020-08-04 11:43:37 - ImageCalibrator - INFO - Calibration images pre-processed: 8/200
(8, 3, 608, 608)
Traceback (most recent call last):
File “createEngine.py”, line 180, in
main()
File “createEngine.py”, line 167, in main
silent=False)
File “createEngine.py”, line 124, in build_engine
engine = builder.build_cuda_engine(network)
RuntimeError: Unable to cast Python instance to C++ type (compile in debug mode for details)

Process finished with exit code 1

cpchiu · August 5, 2020, 1:44am

I have correctly configured the build_cuda_engine() by setting the IBuilderConfig after checking the TensorRT Python doc. But I encountered the same error as build_engine() now.

cpchiu · August 5, 2020, 3:36am

@AakankshaS I just solved the problem by modify the output from get_batch() as the list of output as this [output] as well as using the build_engine() with IBuilderConfig set. And it is running now. Thanks.

Topic		Replies	Views
INT8 calibration cache doesn't created TensorRT tensorrt	3	1104	March 24, 2022
When IInt8Calibrator::read/WriteCalibrationCache been called? DeepStream SDK	2	988	January 23, 2018
Generate calibration file Jetson Xavier NX tensorrt	8	934	September 27, 2021
Segmentation fault in build_engine when using an int8 calibrator TensorRT	6	1260	October 12, 2021
caffemodel to trt5 error TensorRT	3	837	October 12, 2021
Driver error-TensorRT INT8 deploy TensorRT	3	718	November 20, 2020
INT8 calibration file not generating, not building in INT8 mode TensorRT tensorrt , ubuntu , python , jetson-nano	15	2518	June 4, 2022
How to create calibration cache for int8 precision for use in trtexec? TensorRT	1	2165	July 27, 2021
ONNX Model INT8 Engine Build TensorRT tensorrt , jetson-inference , calibration , onnx	3	2069	July 26, 2022
about INT8 mode Arguments TAO Toolkit	2	618	October 12, 2021

Problem related saving calibration cache

Related topics