Variations in heuristics for selecting conv algorithm in different APIs

YashasSamaga · June 30, 2020, 1:22pm

Background:

Frameworks have been using cudnnGetConvolutionForwardAlgorithm until cuDNN 8. cuDNN 8 removed the aforementioned API which made users switch to the _v7 suffixed API (henceforth referred to as the v7 API in this post). In an attempt to avoid conditionally compiled code for cuDNN 7 and 8, I tried to use the v7 API in cuDNN 7. Unfortunately, they don’t seem to be interchangeable.

The main cause of the discrepancy is that v7 API returns WINOGRAD_NONFUSED for some situations when the non-v7 API does not. Based on limited tests on hundreds of convolution configurations, it appears that the non-v7 API does never returns WINOGRAD_NONFUSED. I verified the results from the two APIs against autotuned results. The v7 API’s heuristics appear to agree better with the autotuned results.

Question:

TensorFlow’s cuDNN 8 PR skips WINOGRAD_NONFUSED while selecting an algorithm returned by the v7 API. Why does it do so?

Is there any advice on how to move from the non-v7 API to the v7 API? It naturally feels like directly switching to the v7 API is the right way to go but the TF PR makes it questionable.

YashasSamaga · June 30, 2020, 2:13pm

Removing WINOGRAD_NONFUSED from the v7 API’s results and selecting the best algorithm gives the same result as the non-v7 API.

So the question now is should WINOGRAD_NONFUSED be ignored or allowed?

AakankshaS · June 30, 2020, 6:36pm

Hi @YashasSamaga,

Is there a typo and you meant does not?
For that matter, the reason some framework choose to remove “WINOGRAD_NONFUSED” is because sometimes it does not produce very good numerical precision. However we have not head it causing huge convergence problems either. It’s a per-framework decision to skip it or not.

For general usage, it would be safer to imitate the old TF behavior and skip it to avoid any surprise in existing working scripts. if you want performance and has more tolerance on convergence issues, you can choose to allow it.

Thanks!

Topic		Replies	Views
How do I use cudnn convolutions with cudnn 8.0? cuDNN	4	4436	September 8, 2020
cuDNN8 regression in algorithm selection heuristics cuDNN	6	2808	April 24, 2021
CUDNN_STATUS_LICENSE_ERROR when using CUDNN_CONVOLUTION_FWD_ALGO_WINOGRAD GPU-Accelerated Libraries	0	891	June 8, 2016
cuDNN: Problems finding conv forward algorithm cuDNN	4	1210	October 12, 2021
Choosing Convolution Algo in cuDNN v2 GPU-Accelerated Libraries	0	5241	March 24, 2015
peculiar return values of cudnnGetConvolutionForwardAlgorithm function GPU-Accelerated Libraries	0	1107	November 7, 2016
cudnnGetConvolutionForwardAlgorithm observation and suggested change. cuDNN	0	1516	October 24, 2018
cuDNN 5.1 delivers 2.7x Faster Training of Networks with 3x3 Convolutions Announcements	0	2097	August 12, 2016
How can I query a limited-workspace algorithm with cudnnGetForwardAlgorithm_v7()? cuDNN	1	911	September 11, 2020
Is it possible to get or set CUDNN convolution algorithm on Pytorch? cuDNN pytorch , jupyterlab	1	1788	March 17, 2021

Variations in heuristics for selecting conv algorithm in different APIs

Related topics