I want to to execute FFT on every line of a matrix (MxN), using cufftDx library, But I’m not sure how to implement it.
Is the following idea will do the work?
Define the description of one-line-FFT using the “Description Operators” and use the “Block()” operator.
Define “FFTs Per Block” to be M (the number of lines)
Get the recommended parameters of “elements_per_thread”, “shared_memory_size” and so on.
Use those parameters to execute FFT ,M-times in each thread (so each thread calculate few elements of each line). I’m not sure how to implement this stage at all.
Can someone help?