How to adjust padding convention in tensorRT2.1?

huyuwei · September 14, 2017, 9:07am

I want to convert a mxnet model to tensorRT through caffe parser (mxnet->caffe->tensorRT). The issue is with caffe’s padding convention.

Assume the input is 28x28 (HxW), pooling kernel is 3x3, stride is 2x2, pad is 0, for caffe, the pooling output size is 14x14, for mxnet it’s 13x13.

I notice the API: nvinfer1::INetworkDefinition::setPoolingOutputDimensionsFormula(IOutputDimensionsFormula *formula), and the default formula in each dimension is (inputDim + padding * 2 - kernelSize) / stride + 1. Following this convention, the pooling output size should be 13x13, rather than 14x14.

How can I ensure that pooling layer follows the default convention even if caffe parser is used?

Thanks!

AastaLLL · September 15, 2017, 5:37am

Hi,

Could you share the way you calculate Caffe output dimension?

Based on the Caffe document: Caffe | Convolution Layer
h_o = (h_i + 2 * pad_h - kernel_h) / stride_h + 1 is also 13x13.

Please let us know if we miss anything.
Thanks.

huyuwei · September 15, 2017, 3:46pm

caffe’s pooling convention is:

pooled_height_ = static_cast<int>(ceil(static_cast<float>(height_ + 2 * pad_h_ - kernel_h_) / stride_h_)) + 1;
pooled_width_ = static_cast<int>(ceil(static_cast<float>(width_ + 2 * pad_w_ - kernel_w_) / stride_w_)) + 1;

see https://github.com/BVLC/caffe/blob/master/src/caffe/layers/pooling_layer.cpp#L90

mxnet uses floor, caffe uses ceil. This is how 13 vs 14 happens.

I think tensorRT’s default setting is using floor, right?

soumitry · September 15, 2017, 6:41pm

These are the two definitions.

One is known as ‘valid’ convolution and the other one is known as ‘full’ convolution.

valid: (mxnet default)::
f(x, k, p, s) = floor((x+2p-k)/s)+1
full: which is compatible with Caffe::
f(x, k, p, s) = ceil((x+2p-k)/s)+1

huyuwei · September 16, 2017, 7:59am

Yes, so the question is how can I choose between the two padding conventions in tensorRT?

Specifically, could you please give an example on the usage of nvinfer1::INetworkDefinition::setPoolingOutputDimensionsFormula(IOutputDimensionsFormula *formula)?

soumitry · September 17, 2017, 8:31am

Sorry I do not know that, but if you have trained the model in mxnet. Then you can train it with again with the ‘full’ option and then port it to caffe.

AastaLLL · September 18, 2017, 8:14am

Hi,

We don’t have a sample to demonstrate setPoolingOutputDimensionsFormula() API, but you can check this document for some information:
/usr/share/doc/tensorrt/html/classnvinfer1_1_1_i_output_dimensions_formula.html

Thanks and sorry for the inconvenience.

Topic		Replies	Views
Mxnet2Caffe:The output_shape from convolution is different TensorRT	0	591	December 10, 2018
about a err Assertion `(outDim - 1) * stride < inputDim + prePad' failed TensorRT	1	373	March 11, 2020
Cannot convert caffe pooling layer with kernel_size 1 and stride 2 to TensorRT TensorRT	5	1192	April 22, 2020
cudnn calculates layer sizes different than Caffe CUDA Programming and Performance	5	1200	December 8, 2016
The padding value of IPoolingLayer is different form MaxPooling2D in tensorflow.keras TensorRT tensorrt , tensorflow	2	377	September 3, 2020
How to make maxpooling use "floor"? TensorRT	2	640	October 12, 2021
Does tensorrt still has issues with pooling? TensorRT	2	860	April 4, 2019
TensorRT 6.0 Global PaddingMode TensorRT	4	800	October 12, 2021
TensorRT 4.0 Modification to input. TensorRT	4	1162	September 28, 2018
Questions about the pool layer in TensorRT2.1 Jetson TX2	6	652	October 18, 2021

How to adjust padding convention in tensorRT2.1?

Related topics