I’m wondering what are the differences between
Initially I thought
cudnnNormalizationForwardTraining() would perform more general normalization than batch normalization, but its documentation seems to suggest that it performs just batch normalization. Also, the documentation indicates that both functions are based on the same paper entitled “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift”.
What is the critical difference between the two functions?