cudnn calculates layer sizes different than Caffe

chrisn256 · July 11, 2016, 5:05pm

This means that cudnn based networks cannot load caffe based networks:

Caffe code for pooling layers: (pooling_layer.cpp):

pooled_height_ = static_cast<int>(ceil(static_cast<float>(
      height_ + 2 * pad_h_ - kernel_h_) / stride_h_)) + 1;
  pooled_width_ = static_cast<int>(ceil(static_cast<float>(
      width_ + 2 * pad_w_ - kernel_w_) / stride_w_)) + 1;

Note the ceil. Cudnn does not use ceil (empirically tested).

philippev · July 11, 2016, 5:56pm

I assume that you mean cudnnGetPoolingNdForwardOutputDim does not exactly follow what Caffe expects.

In any case, the actual pooling routine cudnnPoolingForward will respect the dimension of the output tensor descriptor provided. In other words, if the output tensor provided is a bit smaller to what cudnnGetPoolingNdForwardOutputDim would have advised, cudnnPoolingForward will not write out of the bounds of the provided output descriptor, thus should provide identical results than CAffe

chrisn256 · July 11, 2016, 6:39pm

OK cool, good to know, yes.

I have found a situation where the convolution layer does not work that way; meaning if I provide an output tensor that is different than what cudnn recommends for the convolution layer I can get undefined results (reading into memory past the input buffer). Is this interesting or is this just user error?

philippev · July 11, 2016, 9:33pm

For pooling, we have verified explicitly that #2 is correct.

However, you are right that for convolution, there are some cases when you provide an output tensor different than what cudnn recommends, you can get undefined results. We are in the process of fixing them.

If you can provide your use case (convolution descriptor config, input/output tensor descriptors), we will make sure that it works.

chrisn256 · July 12, 2016, 1:33am

My use case does work honestly; I was producing incorrect layer sizes and getting nans in output. I then tracked it down and studied caffe which seems to be something of a gold standard and that is where I saw that they used two different ways of calculating the output width depending on if the layer was pooling or convolution.

Once I implemented this everything worked just fine so cudnn. I think for my use case an error would have been better as my bug just had a code. Tracking down the nans can really take time but it was arguably more useful in the long run to have had to do it ;).

sverikask · December 8, 2016, 11:57am

Since caffe uses ceiling and cudnn doesn’t the described scenario is irrelevant as the suggested cudnnGetPoolingNdForwardOutputDim tensor will always be equal to or smaller than one sized to match caffe. I.e. the question is if cudnn pooling will respect a slightly to large output tensor and not fill the border with garbage.

Topic		Replies	Views
GetPooling2dForwardOutputDim returns wrong dimensions GPU-Accelerated Libraries	0	625	March 31, 2016
How to adjust padding convention in tensorRT2.1? Jetson TX2	7	1444	October 18, 2021
cudnnPoolingForward with CUDNN_POOLING _MAX GPU-Accelerated Libraries	0	1762	March 10, 2015
cudnnPoolingForward with CUDNN_POOLING _MAX CUDA Programming and Performance	0	1028	March 10, 2015
Conv Output Dimensions confusing cuDNN	2	454	May 17, 2021
cudnn error when using pooling layer General	6	1071	October 12, 2021
cuDNN for 3D convolution cuDNN cudnn	1	1860	October 24, 2023
cudnn: input tensor smaller than filter size GPU-Accelerated Libraries	0	653	June 12, 2016
cuDNN batched input - not running correctly cuDNN cuda , cudnn	2	722	December 1, 2020
cudnnPoolingBackward bug GPU-Accelerated Libraries	2	1051	December 7, 2014

cudnn calculates layer sizes different than Caffe

Related topics