I’m developing with NVIDIA’s XAVIER. I plan to implement fft using CUDA, get a profile and check the performance with NVIDIA Visual Profiler.
So I have a question.
About the result of FFT of nvprof
LEN_X: 256
LEN_Y: 64
I have 256x64 complex data like, and I use 2D Cufft to calculate it. If you then get the profile, you’ll see two ffts, void_regular_fft (…) and void_vector_fft (…).
Since this is two-dimensional, do you mean that you did the FFT in the X and Y directions?
2 .About the arguments in ()
void_regular_fft (unsigned int = 64, unsigned int = 1, unsigned int = 8, unsigned int = 8, unsigned int = 0, unsigned int = 0, unsigned int = 2, unsigned int = 1, unsigned int = 1, unsigned int = 0, unsigned int, double> (kernel_arguments_t )
What is the meaning of the argument in () of void_regular_fft () above?
If not, it’s recommended to do so.
Since it is a CUDA accelerated library specific for the FFT problem.
If cuFFT is not an option, could you share which library do you use so we can give a further suggestion?
(Since we don’t find the corresponding void_regular_fft and void_vector_fft in our cuFFT document)
Thank you for your response.
Yes! I’m using cuFFT.
I will paste part of the source code and the result of profiling it with nvprof.
please confirm.
Input data of 256x64 is read from Excel (omitted), and it is calculated by cuFFT.