CUDA Separable Convolution and FFT-Based 2D Convolution

Hi everyone,

Is there any performace comparison of the CUDA separable convolution vs CUDA FFT 2D Convolution on the web or on the NVIDIA webpages? I would like to implement a convolution function in my CUDA code, but I am not sure which approach would be better to implement.