i have a 4096 samples array to apply FFT on it. batching the array will improve speed? is it like dividing the FFT in small DFTs and computes the whole FFT?
i don’t quite understand the use of the batch, and didn’t find explicit documentation on it… i think it might be two things, either:
divide one FFT calculation in parallel DFTs to speed up the process
calculate one FFT x times and average it for the result
both might be wrong ^^
anyone care to explain? maybe show me an explained example?
with batch=1 the FFTs take so much more time than IPP :\ i wanted to speed this up… (now it’s like 3 seconds IPP, 20 seconds CUFFT; 4096 samples C2C, 10000 1D FFTs, without magnitude calculation).
i don’t know if batching is the answer for it though…