Convolutions in cuDNN

wanderine · July 6, 2018, 5:38pm

How are the 2D and 3D convolutions implemented in cuDNN? If I understand the documentation correctly it looks like a dense matrix vector multiplication is used, which seems very inefficient. Why not use sparse matrix vector multiplications instead, as 3 x 3 filters will generate very sparse convolution matrices. Also, is there some comparison between different ways of performing the convolutions, for example to a convolution implementation which is not done through a matrix multiplication, but where each GPU thread performs convolutions for one pixel.

KingDudman · July 8, 2018, 7:32pm

In the doc there are several flags just for the forward pass typed cudnnConvolutionFwdAlgo_t. Those are the different algorithms that can be used just for the fwd pass. If you use the function cudnnFindConvolutionForwardAlgorithm() it will fill an array in ascending order by time of calculation of cudnnConvolutionBwdDataAlgo_t. These are needed because of workspace size.

Topic		Replies	Views
multi-threading with cuSPARSE lib GPU-Accelerated Libraries	15	1573	November 10, 2017
Choosing Convolution Algo in cuDNN v2 GPU-Accelerated Libraries	0	5285	March 24, 2015
Why is 2-D convolution slower than the matrix product? CUDA Programming and Performance	17	7067	April 18, 2015
cudnn dilated convolution low efficiency cuDNN	0	479	May 29, 2019
2d conv and 3d conv is not same cuDNN	1	574	February 3, 2021
Why is my 'trivial' convolution kernel faster than cuDNN? CUDA Programming and Performance	4	578	May 29, 2022
cuDNN: Problems finding conv forward algorithm cuDNN	3	1294	May 23, 2021
cudnnGetConvolutionForwardAlgorithm observation and suggested change. cuDNN	0	1547	October 24, 2018
Cudnn convolution is significantly slow cuDNN	3	1255	April 19, 2022
cuDNN support for sparse 2D filters cuDNN cuda	1	724	October 22, 2021

Convolutions in cuDNN

Related topics