Problem of fusing Convolution and BatchNorm

hbwx26 · December 29, 2021, 3:22am

Cuda ENV：

cudnn version:8005
cudaGetDeviceProperties::major:7
cudaGetDeviceProperties::minor:5

Problem：

I try to fuse Convolution and BN by CUDNN_FUSED_SCALE_BIAS_ACTIVATION_CONV_BNSTATS.The following API are mainly used：

cudnnCreateFusedOpsPlan()
cudnnCreateFusedOpsConstParamPack()
cudnnSetFusedOpsConstParamPackAttribute()
cudnnMakeFusedOpsPlan()
cudnnCreateFusedOpsVariantParamPack()
*cudnnSetFusedOpsVariantParamPackAttribute()
cudnnFusedOpsExecute()

When I use cudnnSetFusedOpsConstParamPackAttribute to set the parameters according to [Conditions for Fully Fused Fast Path (Forward)], the code can run.
I saw the API document says: “As of cuDNN 7.6.0, if the conditions in Table 26 are met, then the fully fused fast path will be triggered. Otherwise, a slower partially fused path will be triggered.”
However, if the type of input data is CUDNN_DATA_FLOAT instead of CUDNN_DATA_HALF, the code will raise a CUDNN_STATUS_BAD_PARAM error.

How to choose arguments when the conditions in [Conditions for Fully Fused Fast Path (Forward)] are not met.How do I set cudnnTensorFormat_t , etc. to make cudnn support Conv-BN fusion when cudnnDataType_t==CUDNN_DATA_FLOAT .

Topic		Replies	Views
Fusion of convolution and BatchNorm cuDNN	4	2152	April 29, 2022
Problems encountered in executing cudnnFusedOpsExecute() cuDNN	0	522	September 26, 2022
Cudnn fused conv+bias cuDNN	3	2215	December 9, 2021
Cudnn backend api for fused op cuDNN cudnn	7	2336	July 15, 2021
Get CUDNN_STATUS_BAD_PARAM while executing cudnnFusedOpsExecute() cuDNN debugging-and-troubleshooting	5	1664	October 9, 2022
Fuse Operators cuDNN	5	2662	March 31, 2021
Is cudnn v4 /rc5 supporting FP16 version of cudnnBatchNormalization ? GPU-Accelerated Libraries	1	777	December 23, 2016
Does cudnnFusedOps_t not support backward fusion? cuDNN	1	750	June 4, 2022
cudnnBatchNormalizationForwardTrainingEx CUDNN_STATUS_NOT_SUPPORTED cuDNN cudnn	0	155	July 4, 2024
batch normalization: CUDNN_STATUS_NOT_SUPPORTED CUDA Programming and Performance	2	3158	May 3, 2018

Problem of fusing Convolution and BatchNorm

Related topics