Cudnn may be slower?

maofeng · September 8, 2015, 5:47am

Problem about caffe with cudnn performance.
Caffe with cudnn is 1.6x faster than that without cudnn, when batch-size was 64. However, when I set batch-size to 1, caffe with cudnn is 2x slower. Dose cudnn have heavy overhead?

Any help would be much appreciated!

t3l · September 8, 2015, 6:55am

Do you use convolution layers? If yes, your findings are in alignment with existing literature.

Search for the paper “cuDNN: Efficient Primitives for Deep Learning” (Chetlur, Sharan et. al.)

In that paper, figure 2 gives you a rough idea about the performance of cuDNN convolutions vs. batch size. (blue line in the graph = cuDNN, red line = CAFFE)

Greetings,

t3l

maofeng · September 8, 2015, 9:38am

Hi t3l,
Thanks a lot!
Yes, I use convolution layers.
So the cudnn is not suitable for 1 image prediction based on caffe.
The paper did not explain the reason. I guess it was because Cudnn need more cudaMemcpy for its input data preparation.
I think there should be a dynamic switch so as not to invoke cudnn when the coming batch-size is too small.

Feng

philippev · September 28, 2015, 4:06am

Upcoming cuDnn v4 has improved performance for batchSize=1 on Maxwell Architecture. Stay tuned.

Topic		Replies	Views
cuDNN8: extreamly slow first iteration of CNN training or inference cuDNN	3	1721	December 30, 2021
CuDnn slow convolution operation cuDNN kernel	1	40	January 31, 2025
cudnn calculates layer sizes different than Caffe CUDA Programming and Performance	5	1188	December 8, 2016
cudnn6 slow and problematic on TX2, JetPack 3.1 Jetson TX2	22	2527	October 18, 2021
cudnn dilated convolution low efficiency cuDNN	0	435	May 29, 2019
Cudnn convolution is significantly slow cuDNN	3	1098	April 19, 2022
About convolution performance CUDA Programming and Performance	3	774	February 17, 2017
cuDNN v2: Higher Performance for Deep Learning on GPUs Technical Blog	2	453	November 18, 2015
cuDNN runs pretty slow cuDNN	2	988	April 24, 2023
Why is 2-D convolution slower than the matrix product? CUDA Programming and Performance	17	6726	April 18, 2015

Cudnn may be slower?

Related topics