cuDNN dropout backward mistakenly scaling up

andrewrobbins · May 12, 2020, 1:14am

I’m building a framework using cuDNN and I noticed in my testing that cuDNN dropout backward is scaling up its backprop errors based on the dropout rate. I believe that this is a mistake in the implementation, and may cause problems with network learning, since the error rate is incorrectly scaled up during training each time a tensor back propagates through a dropout layer. This scaling up in the backward pass disagrees with the documentation:

(BTW, I understand that the signal is intended to be scaled up during the forward pass so that signal levels remain similar during training and inference; I think that scaling during the backward pass is an issue.)

SunilJB · May 12, 2020, 6:11am

Hi,
Provide details on the platforms you are using:
o Linux distro and version
o GPU type
o Nvidia driver version
o CUDA version
o CUDNN version
o Tensorflow and PyTorch version

If possible, please share the script & model file to reproduce the issue along with error info.

Thanks

andrewrobbins · May 12, 2020, 5:18pm

I’ve created a small testcase that shows the behavior:

to run:
tar -xvf dropoutBackpropTestcase.tar
cd dropoutBackpropTestcase
make

However you may need to adjust the cuda and cudnn paths in the makefile.

andrewrobbins · May 12, 2020, 5:21pm

also I am using a TitanRTX, ubuntu 18.04, cuda 10.2 and the cudnn corresponding to cuda10.2.

andrewrobbins · May 16, 2020, 5:11pm

After thinking about this further, I believe that the behavior of cudnn dropout backward is correct. I think that my confusion was just based on the documentation. Sorry for the confusion.

Topic		Replies	Views
cudnnDropoutBackward doesn't multiply gradient by 1/(1-dropout) GPU-Accelerated Libraries	1	650	April 21, 2016
How to implement a Dropout Layer using cuDNN? cuDNN	3	2594	June 21, 2019
maybe a cudnn bug cuDNN	2	685	April 27, 2018
CUDNN Batchnorm Backward result is not correct? Found big difference than CPU result. cuDNN	2	813	October 24, 2018
MultiHeadAttnBackwardData Wrong Result with postDropout enabled cuDNN	1	901	July 8, 2022
cudnnPoolingBackward bug GPU-Accelerated Libraries	2	1051	December 7, 2014
cudnnConvolutionBackwardFilter crashes the system cuDNN	2	610	September 5, 2018
CUDNN Backpropagation GPU-Accelerated Libraries	0	724	November 15, 2017
Tanh Activation Backward Precision? cuDNN	1	657	April 29, 2020
cuDNN: Need training (backward propagation) sample code GPU-Accelerated Libraries	3	3321	June 28, 2015

cuDNN dropout backward mistakenly scaling up

Related topics