In a kernel I want add two nunbers as followed:
double ttd=0.0; ...... ...... pttd[id]+=ttd;
(if ttd=0.0, erveryting work well; if I set ttd=1, the cufft will throw an error as followed:
cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.2/cufft/src/cufft.cu, line 140 cufft: ERROR: CUFFT_INVALID_VALUE error: FFT Execute failed : error: isign = 1 error: (N1, N2, N3) is (1, 224, 224) cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.2/cufft/src/plan.cu, line 73 cufft: ERROR: CUFFT_INTERNAL_ERROR
Who know why? can you tell me some possible reasons?
By the way, I didn’t run cufft in this function. I had found wherever there is an error, the cufft will throw the error. So I had used the cufft as a debug tool because the deviceemu and cugdb don’t work in my mutiGPU and multiThread program but the cufft always work.