Performance discrepancy between cudnn Convolution Bias Activation Forward and cudnn Convolution Forward

ziheng · April 7, 2020, 5:23am

I notice that cudnnConvolutionBiasActivation forward can be much slower than the corresponding cudnnConvolutionForward call on some input sizes. For depthwise convolution (group size = input channel), input channel = output channel = 512 and image dimension 14, stride 1 for example, on the jetson nano, the first call is more than 50 times slower than the second call for the same algorithm. This renders the fused op useless in this case. Has anybody else encountered similar things?

SunilJB · April 7, 2020, 8:50am

Hi,

Could you please share the repro script so we can help better.
Also, please provide details on the platforms you are using:
o Linux distro and version
o GPU type
o Nvidia driver version
o CUDA version
o CUDNN version
o Python version [if using python]
o Tensorflow and PyTorch version
o TensorRT version

Thanks

Topic		Replies	Views
cuDNN8: extreamly slow first iteration of CNN training or inference cuDNN	3	1727	December 30, 2021
Slow first iteration of cuDNN convoultion operations cuDNN	2	846	July 1, 2022
Depthwise conv workspace size with cuDNN 7 Grouped Convolution cuDNN	1	904	June 5, 2020
Slow cudnn Convolution Bias Activation Forward for grouped convolution cuDNN	2	1026	April 8, 2020
cudnnConvolutionBiasActivationForward generating wrong half-precision result for group == 32 cuDNN	1	504	August 22, 2018
cudnnActivationForward doesn't work with CUDNN_ACTIVATION_IDENTITY cuDNN	1	657	May 31, 2023
Cudnn may be slower? GPU-Accelerated Libraries	3	2643	September 28, 2015
cudnn: input tensor smaller than filter size GPU-Accelerated Libraries	0	649	June 12, 2016
Cudnn forward conv 5x5 benchmark, 14ms (Nano), 0.3ms(2080ti) Jetson Nano	3	509	October 18, 2021
cuddn cudnnActivationBackward() function follows different format?? GPU-Accelerated Libraries	0	549	October 30, 2017

Performance discrepancy between cudnn Convolution Bias Activation Forward and cudnn Convolution Forward

Related topics