In my code, I need to implement 1D FFT algorithm to run efficiently on GPU. Where can I find such implementation? Maybe a source code from the Cufft library?
I want to run FFT and more operations on the same kernel, but Cufft library-functions cant be launched from a kernel, so I figured that I need to implement the FFT by myself. Is there a better solution?