convert faster rcnn to int8

OnePieceOfDeepLearning · October 12, 2017, 2:38pm

I am trying to calibrate faster rcnn to int8.

most of code is from https://devtalk.nvidia.com/default/topic/1015387/tensorrt-fails-to-build-fasterrcnn-gie-model-with-using-int8/

here is the code for calibration

CHECK(cudaMalloc(&mImInfoInput, batchSize * 3 * sizeof(float)));                                                                                                                                                            
                                                                                                                                                                                                                                    
   float *imInfo = new float[batchSize * 3];                                                                                                                                                                                   
   for (int i = 0; i < batchSize; i++) {                                                                                                                                                                                       
       imInfo[i * 3] = height;     // num of rows                                                                                                                                                                               
       imInfo[i * 3 + 1] = width; // num of colums                                                                                                                                                                             
       imInfo[i * 3 + 2] = scale;   // image scale                                                                                                                                                                                   }                                                                                                                                                                                                                           
                                                                                                                                                                                                                                    
    CHECK(cudaMemcpy(mImInfoInput, imInfo, batchSize * 3 * sizeof(float), cudaMemcpyHostToDevice));                                                                                                                             
delete[] imInfo;          

....

bool getBatch(void *bindings[], const char *names[], int nbBindings) override {                                                                                                                                                 
    if (!mStream.next()) return false;                                                                                                                                                                                                                                                                                                                                                                                                                 
                                                                                                                                                                                                                                    
    CHECK(cudaMemcpy(mDataInput, mStream.getBatch(), mInputCount * sizeof(float), cudaMemcpyHostToDevice));                                                                                                                     
    assert(!strcmp(names[0], "data"));                                                                                                                                                                                          
    assert(!strcmp(names[1], "im_info"));                                                                                                                                                                                       
    bindings[0] = mDataInput;                                                                                                                                                                                                   
    bindings[1] = mImInfoInput;                                                                                                                                                                                                 
    return true;                                                                                                                                                                                                                
}

But the accuracy rate drops significantly.

there are some questions about calibration.

Do I need to set a large batch size for calibration? How many images should be used? any rule of thumb?
Here is my config for faster rcnn, my input image is 1024 * 512, before forwarding, I resize my input image to 512 * 256, so I put [1024, 512, 0.5] in imInfo. And then rois are scaled by 0.5 to get the correct size.
In the same way , when doing calibration for INT8, every image in one batch is 512 * 256, and I put [1024, 512, 0.5] in imInfo, am I doing right?

thanks.

Topic		Replies	Views
Generate the INT8 calibration In TensorRT GPU-Accelerated Libraries	0	668	October 23, 2017
Generate the INT8 calibration GPU-Accelerated Libraries	0	580	October 23, 2017
How to do Int8 calibration for Faster RCNN? TensorRT	1	623	August 17, 2020
TensorRT 8.0.3 imagenet resnet model INT8 conversion identical output with different input after calibration TensorRT tensorrt	3	1302	December 23, 2021
INT8 calibration causes a significant decrease in accuracy when batch_size is greater than 1 TensorRT tensorrt	6	1031	January 15, 2021
How to do INT8 calibrate FRCNN model? Jetson TX2 neural-network-framework	7	922	October 18, 2021
Calibrate INT8 with different inputs of different sizes TensorRT	0	714	June 26, 2019
How to do calibration for int8 engine correctly? TensorRT tensorrt	0	542	September 22, 2020
TensorRT INT8 calibration in C++ api TensorRT tensorrt	2	1938	February 14, 2022
tensorRT int8 GPU-Accelerated Libraries	0	926	June 8, 2017

convert faster rcnn to int8

Related topics