Problem with cufftPlan2d

HamidKh · May 8, 2017, 3:00pm

Hello everybody,

I am going to run 2D complex-to-complex cuFFT on NVIDIA K40c consisting of 12 GB memory. However, there is a problem with cufftPlan2d for some sizes. For instance, for a given size of X=Y=22912, it ends up with CUFFT_ALLOC_FAILED error. It is noteworthy that the command works very well with larger size of X=Y= 23040. That is, the error does not occur because of memory leakage.
I tried to estimate the required size for performing FFT on a 2D matrix of 2291222912 using cufftEstimate2d(). It is surprisingly returns zero workload size, but the output for 2304023040 is around 3.5 GB.

You can find the code following.

Regards,
Hamidreza

==================================================
void fft_kernel_gpu_single_fit(const int M, const int N, cufftComplex *A, int tSize)
{

printf("fft_kernel_gpu_single_fit: M %d N %d\n", M, N);

cufftComplex *gpudata;

if (cudaMalloc((void**)&gpudata, tSize * sizeof(cufftComplex)) != CUFFT_SUCCESS)
{
	fprintf(stderr, "cudaMalloc Error: Unable to alloc memory\n");
	return;
}

if (cudaMemcpy(gpudata, A, tSize * sizeof(cufftComplex), cudaMemcpyHostToDevice) != CUFFT_SUCCESS)
{
	fprintf(stderr, "cudaMemcpy Error: Unable to copy data from host to gpu\n");
	return;
}

cufftHandle plan;

/*int n[2] = {M, N};
if (cufftPlanMany(&plan, 2, n, NULL, 1, 0, NULL, 1, 0, CUFFT_C2C, 1) != CUFFT_SUCCESS)
{ 
	fprintf(stderr, "CUFFT Error: Unable to create plan\n");
	return;
}*/

if (cufftPlan2d(&plan, M, N, CUFFT_C2C) != CUFFT_SUCCESS)
{ 
	fprintf(stderr, "CUFFT Error: Unable to create plan\n");
	return;
}

if (cufftExecC2C(plan, gpudata, gpudata, CUFFT_FORWARD) != CUFFT_SUCCESS)
{
	fprintf(stderr, "CUFFT Error: Unable to execute plan\n");
	return;
}

if (cudaDeviceSynchronize() != cudaSuccess)
{
	fprintf(stderr, "Cuda Error: Failed to synchronize\n");
	return;
}

if (cudaMemcpy(A, gpudata, tSize * sizeof(cufftComplex), cudaMemcpyDeviceToHost) != CUFFT_SUCCESS)
{
	fprintf(stderr, "cudaMemcpy Error: Unable to copy data from gpu to host\n");
	return;
}

if (cudaDeviceSynchronize() != cudaSuccess)
{
	fprintf(stderr, "Cuda Error: Failed to synchronize\n");
	return;
}
	
cufftDestroy(plan);
cudaFree(gpudata);

}

HamidKh · May 9, 2017, 9:20am

I found that the problem occurs when the input size is a number which cannot be factorable into primes less than or equal to 127. However, according to the nvidia documentations, this condition is just for multi-gpus not single ones.

Topic		Replies	Views
CuFFT :: Invalid Plan CUDA Programming and Performance	2	3282	June 17, 2009
Is nx in cufftPlan1d function not suit some number? GPU-Accelerated Libraries	3	898	August 14, 2016
cufftPlan2d fails CUDA Programming and Performance	14	21188	September 17, 2007
cufft error (?) CUDA Programming and Performance	7	9159	March 5, 2012
cufft what is maximum size for 2D fft CUDA Programming and Performance	5	11275	August 19, 2013
cuFFT return zeros CUDA Programming and Performance	6	1929	May 14, 2011
allocation problem in cuFFT CUDA Programming and Performance	2	2639	September 16, 2009
Arbitrary sizes in cuFFT GPU-Accelerated Libraries	0	319	May 26, 2020
CUFFT: allocation error CUDA Programming and Performance	1	3066	December 3, 2008
[SOLVED] cuFFT not liking a given length (error 2), but will accept larger work GPU-Accelerated Libraries	5	941	July 2, 2019

Problem with cufftPlan2d

}

Related topics