I’m trying to use cufft, but have a problem. I need to do many crosscorrelations, and do this using 2D fft’s. The program is compiled with openmp support.
If I run the program with only one thread, everything is fine. If I try to use more threads, then at one point two plans will be made with identical handles, and
the cufft lib will start producing error messages. So far, I’ve tried to add omp critical sections (locking), I’ve tried to create unique streams for all plans, nothing works.
Note that the cufft lib just has a .h header, the rest of the program is completely unaware of CUDA, and it’s compiled with g++.
Any suggestions ???
Many thanks in advance,
P.S. I’ve posted this earlier on http://forums.nvidia.com/index.php?showtop…2639&hl=fft , but I guess this forum is more appropriate, so re-posting here…