I am using cufft.h to comput fft. My requirement is to compute fft of four different sequence at the same time using streams. Earlier I used this library to compute fft of one sequence and it worked but now it is throwing errors when I am calling the cufft functions from the kernel. The error thrown is host function cannot be called from global function.
How can I compute the fft from the kernel when I dont want to copy data from device to host and then calculate fft and then copy it back to device for further calculation.
Please help
Thanks in advance