First thing first I want you to know that I’m kinda newbie in CUDA.
I’m developing under C/C++ language and doing some tests with CUDA and espacially with cuFFT.
I have several questions and I hope you’ll be able to help me.
I saw that cuFFT fonctions (cufftExecC2C, etc.) can’t be call by the device. Can someone confim this? And is there any FFT fonction that can be call by the device?
I work on a cross-correlation fonction. I divide the two pictures (on wich I will compute the croos-correlation) into small sub-pictures.
Then, on each sub-picture I compute convolution (FFT -> multiplication -> invert FFT).
Unfortunately the sub-pics are small (32*32). With the fex tests I’ve made I saw the convolution with the GPU is slower than with CPU, that’s understandable due to the size of the
image (but maybe I’m wrong and it’s problem with my code). So my question is : the convolution time on GPU could it be faster than on CPU with this picture size?
Thanks in advance for your help.
PS : Sorry for my lame English, I hope you’ll understand my problems.