Thank you for the source code for CUFFT and CUBLAS. I am working on a project that requires me to modify the CUFFT source so that it runs on streams and also allows data overlap. It is a proof of concept to analyze whether the NVIDIA cards can handle the workload we need in our application.
I notice by running CUFFT code in the profiler that not all the source for CUFFT is provided. For example, there are routines such as c2c_radix2_mpsm and c2c_radix2_mpgm that show up in the profiler and not in the source release. Also routines such as c2c_twiddle and c2c_transpose are not included. The source for host code to determine the cufftPlan would be extemely useful also.
Is there any plan to release the full source for CUFFT anytime soon ?