Hi everyone,
First thing first I want you to know that I’m kinda newbie in CUDA.
I’m developing under C/C++ language and doing some tests with CUDA and espacially with cuFFT.
I have several questions and I hope you’ll be able to help me.

I saw that cuFFT fonctions (cufftExecC2C, etc.) can’t be call by the device. Can someone confim this? And is there any FFT fonction that can be call by the device?

I work on a crosscorrelation fonction. I divide the two pictures (on wich I will compute the crooscorrelation) into small subpictures.
Then, on each subpicture I compute convolution (FFT > multiplication > invert FFT).
Unfortunately the subpics are small (32*32). With the fex tests I’ve made I saw the convolution with the GPU is slower than with CPU, that’s understandable due to the size of the
image (but maybe I’m wrong and it’s problem with my code). So my question is : the convolution time on GPU could it be faster than on CPU with this picture size?
Thanks in advance for your help.
PS : Sorry for my lame English, I hope you’ll understand my problems.