Converting ConvolutionFFT2D to ConvolutionFFT3D

wanderine · January 16, 2009, 10:50pm

Hello, I want to convert the example code ConvolutionFFT2D to ConvolutionFFT3D, i.e. perform 3D FFT convolution in CUDA. This is the first time I program in CUDA.

Most of the code is straight forward to change to 3D from 2D, but I got some problems.

I’m a bit confused about the memory allocation, why is the memory for a_Kernel allocated with cudaMallocArray and d_PaddedKernel with cudaMalloc?

Is there a way to use cudaMallocArray with 3 dimensions instead of 2? I tried it but the compiler complained.

Why do I need to bind a texture to the memory allocated with cudaMallocArray? Because the texture memory is faster?

I changed tex2D to tex3D in the kernel-code but how do I bind to a 3D texture if I cannot use cudaMallocArray for 3D ?

Is the textures only used for the padding of the data, not for the actual FFT calculations?

printf("Allocating memory...\n");

		h_Kernel	   = (Complex *)malloc(KERNEL_SIZE);

		h_Data		 = (Complex *)malloc(DATA_SIZE);

		h_ResultCPU	= (Complex *)malloc(DATA_SIZE);

		h_ResultGPU	= (Complex *)malloc(FFT_SIZE);

		//cutilSafeCall( cudaMallocArray(&a_Kernel, &float2tex, KERNEL_W, KERNEL_H) );

		//cutilSafeCall( cudaMallocArray(&a_Data,   &float2tex,   DATA_W,   DATA_H) );

		cudaMalloc((void**)&a_Kernel, sizeof(cufftComplex)*KERNEL_W * KERNEL_H * KERNEL_D);

		cudaMalloc((void**)&a_Data, sizeof(cufftComplex)*DATA_W * DATA_H * DATA_D);

		cutilSafeCall( cudaMalloc((void **)&d_PaddedKernel, FFT_SIZE) );

		cutilSafeCall( cudaMalloc((void **)&d_PaddedData,   FFT_SIZE) );

	   

		printf("...copying input data and convolution kernel from host to CUDA arrays\n");

		cutilSafeCall( cudaMemcpyToArray(a_Kernel, 0, 0, h_Kernel, KERNEL_SIZE, cudaMemcpyHostToDevice) );

		cutilSafeCall( cudaMemcpyToArray(a_Data,   0, 0, h_Data,   DATA_SIZE,   cudaMemcpyHostToDevice) );

		printf("...binding CUDA arrays to texture references\n");

		cutilSafeCall( cudaBindTextureToArray(texKernel, a_Kernel) );

		cutilSafeCall( cudaBindTextureToArray(texData,   a_Data)   );

Topic		Replies	Views
binding texture with linear memory CUDA Programming and Performance	1	4296	April 21, 2007
Guide: cudaMalloc3D and cudaArray's CUDA Programming and Performance	0	19366	July 10, 2011
cudaBindTexture2D problem CUDA Programming and Performance	3	11767	August 3, 2010
Using 2d texture fetchs without binding to array Can it be done? CUDA Programming and Performance	5	3333	February 21, 2008
Using mapped D3D texture as cufft input CUDA Programming and Performance	0	1697	January 18, 2009
error: identifier "texKernel"/"texData" is undefined? CUDA Programming and Performance	1	6179	July 12, 2009
Using Textures CUDA Programming and Performance	10	21825	March 29, 2007
Why there is no cudaBindTexture3D? It would be nice to have this ... CUDA Programming and Performance	9	4667	December 12, 2009
cudaBindTexture2D Problem? can't find the reason for this bahavior CUDA Programming and Performance	2	1937	August 19, 2009
performance of cudaBindTextureToArray CUDA Programming and Performance	1	7837	July 5, 2007

Converting ConvolutionFFT2D to ConvolutionFFT3D

Related topics