Cudnn8.0.4 Convolution Occupy High Memory

840302039 · November 27, 2020, 10:21am

After upgrade from my caffe from cudnn7.5(cuda10) to cudnn8.0.4(cuda11), the whole training and infer gpu memory become more higher, and the only change is that i fix the cudnnConvolutionFwdAlgo_t to CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM to avoid alloc workspace.
This is the source file that I only change, please give some suggestion where is wrong?Thanks.
sourcecode.zip (10.0 KB)

AakankshaS · December 2, 2020, 6:33pm

Hi @840302039,
CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM does not require any workspace.
For training there are also backward paths. If you only changed forward algo to algo0, then it is likely the backward paths require a lot more workspace than 7.5
There are also memory needed for auto tuning.

Thanks!

AakankshaS · December 4, 2020, 3:22am

Hi,
You can isolate the problem by checking the API logs. There will be workspace sizes required for each operation in the API logs, something like
workSpaceSizeInBytes: type=size_t; val=1288;
For how to generate API logs, see Developer Guide :: NVIDIA Deep Learning cuDNN Documentation

Thanks!

840302039 · December 10, 2020, 8:01am

Thanks for your suggestion. I turn on the API logs, and logs tells that, all workspaceSizeInBytes are 0. By the way, I have set three algo as below.
fwd_algo_[i].algo = CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM;
bwd_filter_algo_[i].algo = CUDNN_CONVOLUTION_BWD_FILTER_ALGO_0;
bwd_data_algo_[i].algo = CUDNN_CONVOLUTION_BWD_DATA_ALGO_0;

So I think maybe CUDNN 8 occupy memory more than CUDNN7.5.

Topic		Replies	Views
How can I query a limited-workspace algorithm with cudnnGetForwardAlgorithm_v7()? cuDNN	1	885	September 11, 2020
How do I use cudnn convolutions with cudnn 8.0? cuDNN	4	4373	September 8, 2020
Choosing Convolution Algo in cuDNN v2 GPU-Accelerated Libraries	0	5221	March 24, 2015
cudnnGetConvolutionForwardAlgorithm observation and suggested change. cuDNN	0	1497	October 24, 2018
cudnn dilated convolution low efficiency cuDNN	0	436	May 29, 2019
Workspacre size is zero cuDNN	3	1532	October 4, 2019
[cudnn bug] 3D Convolution failure when using large image size (GPU memory is okay) cuDNN	1	983	July 29, 2018
A build failure after upgrading to JetPack 4.4 from JetPack 4.4 DP Jetson AGX Xavier cuda , nvbugs	5	684	October 18, 2021
Cudnn can't use tensorcore cuDNN	0	767	March 16, 2023
CUDNN_STATUS_NOT_SUPPORTED for algo 6 and 3 cuDNN cudnn	1	1161	January 14, 2021

Cudnn8.0.4 Convolution Occupy High Memory

Related topics