I am trying to speed up the digital signal processing of an optical coherence tomography setup using CUDA. I am quite new to CUDA,
so your advice would be greatly appreciated.
Basically, a CCD generates a 2D array (about 1024 x 640) of values at about 29Hz. The values in each row (1024 rows of each 512 elements) need to be 1D interpolated and Fourier transformed. So you have 1024 identical series of operations which need to be performed. My questions are:
-Is this a problem on which CUDA could help?
-Should I try to let the GPU calculate the interpolation and FFT row per row or all rows at one time? (General programming strategy)
-Do you know of any decent GPU 1D interpolations?
Thanks in advance!
PS: If this matters, I am trying to implement this in Labview