TensorRT3 YOLOv2 int8 calibration

moodie · March 21, 2018, 5:01pm

Hello,

I have a YOLOv2 model deployed successfully with tensorrt 3 using tensorrt for everything up to the final 1x1 convolutional prediction layer. I’d like to convert the early stages to int8 precision. When I run the conversion process I get this error:
NvPluginYOLO.cu:58: virtual void nvinfer1::plugin::PReLU::configure(const nvinfer1::Dims*, int, const nvinfer1::Dims*, int, int): Assertion `mBatchDim == 1’ failed.
Initially I thought this error was a internal renaming of maxBatchSize, but I’m unsure what is actually causing this error currently.
To be clear, the YOLOv2 conversion code works correctly without int8 calibration.

This is with both a batch size of 5 and 1 returned by the nvinfer1::IInt8EntropyCalibrator::getBatchSize() function.

My max batch size is set to 1 via IBuilder::setMaxBatchSize().

I was under the impression that since int8 is not supported by plugin layers, that the data is converted from int8 to fp32 and back for each plugin layer. Which is confirmed with:
Adding reformat layer: conv2 reformatted input 0 ((Unnamed ITensor* 4)) from Float(1,480,138240,4423680) to Int8(1,480,138240:4,1105920)
Adding reformat layer: relu_conv2 reformatted input 0 (bn_conv2) from Int8(1,480,138240:4,2211840) to Float(1,480,138240,8847360)

SiddharthSharma_TPM · April 26, 2018, 11:45pm

We created a new “Deep Learning Training and Inference” section in Devtalk to improve the experience for deep learning and accelerated computing, and HPC users:
https://devtalk.nvidia.com/default/board/301/deep-learning-training-and-inference-/

We are moving active deep learning threads to the new section.

URLs for topics will not change with the re-categorization. So your bookmarks and links will continue to work as earlier.

-Siddharth

373197201 · April 28, 2018, 5:59am

Hi Moodie:

How did you convert the early stages to int8 precision.?

Thanks
Bryan

moodie · May 9, 2018, 5:35pm

Hello,

I made a calibrator and then called the following:
auto calibrator = std::make_unique(calibration_path, calibration_data, m_img_size);
builder->setInt8Calibrator(calibrator.get());
builder->setInt8Mode(true);
prior to
engine = builder->buildCudaEngine(*network);

skabhilash10 · July 18, 2019, 8:14am

Hi Moodie ,

Can you pls share your yolov2 implementation using tensor Rt , i am not able get the implementation done

Topic		Replies	Views
INT8 Calibration with PReLU Plugin Layer TensorRT	3	1210	September 14, 2018
TRT for yolov3: FP16 and INT8 optimization failed General	7	4463	October 12, 2021
Migrating INT8 calibration from TensorRT 6 to TensorRT 7 in YoloV3 and YoloV4 failed TensorRT tensorrt , jetson-inference	9	1590	December 28, 2020
TRT for yolov3: FP16 and INT8 optimization failed TensorRT	1	1157	October 22, 2018
TensorRT Yolo Int8 on TITAN RTX Frameworks (archived) tensorflow	0	700	September 7, 2020
tensorrt for caffe-yolov3 optimization failed TensorRT	2	1178	April 5, 2019
TensorRT fails to build FasterRCNN GIE model with using INT8 TensorRT	28	9343	May 3, 2018
ERROR:Calibration failure occured with no scaling factors detected TensorRT	0	2474	March 12, 2019
Int8 calibrator issue TensorRT	0	471	October 23, 2019
INT8 Calibration YOLOv3 TensorRT	2	1064	December 2, 2019

TensorRT3 YOLOv2 int8 calibration

Related topics