cudaMemcpy3D memory duplication

francois_86 · April 4, 2013, 9:16am

Hi everybody,

I’ve got an issue when I want to update the memory content of a cuda 3D array via cudaMemcpy3D.
Init : I create a cuda 3D array, copy a device buffer (“d_out”) and then bind it to a texture.

What I want is to update the content of my array in a for loop, unfortunately what I get is
a duplication of the content with an offset of size : width*height (seems strange ?)
Here is a little piece of code :

for(int a = 0; a <10; ++a)
    {
      /** here some kernels reading the texture and
         writing into d_out ***/

      checkCuda( cudaMemcpy3D(&copyParams),pExec); 
        // update the content of the array with the modified "d_out"
        //but the result seems to be a duplication of d_out with a height *width offset
        // checked via simple reading he texture
    }

Thanks a lot for your help!

françois

Karan_Sharma · April 4, 2013, 10:17am

Cannot comment much without seeing the code.
Try to use cudaThreadSynchronize() after your kernel.
It may be needed since you are doing a memcpy immediately after the kernel.

francois_86 · April 4, 2013, 11:40am

thank you for your answer, here are some details :

3d array and texture init :

// create 3D array
	cudaExtent extent = make_cudaExtent(W, H, Z);
	cudaChannelFormatDesc channelDesc = cudaCreateChannelDesc<float>();
	cudaArray* cu_array=0;
	checkCuda( cudaMalloc3DArray(&cu_array, &channelDesc, extent),pExec );

	// copy data to 3D array
	cudaMemcpy3DParms copyParams = {0};
	//memory pitch
	copyParams.srcPtr   = make_cudaPitchedPtr((void*)d_out,   extent.width*sizeof(float),extent.width , extent.height);
	copyParams.dstArray = cu_array;
	copyParams.extent   = extent;
	copyParams.kind     = cudaMemcpyDeviceToDevice;
	checkCuda( cudaMemcpy3D(&copyParams),pExec);
	
	// set texture parameters
	tex.normalized = false;                      // access with normalized texture coordinates
	tex.filterMode = cudaFilterModeLinear;      // linear interpolation
	tex.addressMode[0] = cudaAddressModeWrap;   // wrap texture coordinates
	tex.addressMode[1] = cudaAddressModeWrap;
	tex.addressMode[2] = cudaAddressModeWrap;

	// bind texture to array
	checkCuda(cudaBindTextureToArray(tex, cu_array, channelDesc),pExec);

and here is the for loop in the main :

for(int a = 0; a <nbangle; ++a)
	{
		float theta_r = -buffAngTilt[a] * (PI/180);
		//
		checkCuda( cudaMemset(d_out, 0, size_f), pExec); // re init
		// 1st kernel
		transformKernel<<<dimGrid, dimBlock>>>(     d_out, //output  // input data is in tex
													W, 
													H,
												              Z,												theta_r);
			
		//
		cudaError_t err = cudaGetLastError();
		 if( cudaSuccess != err) {
			fprintf(pExec, " :CheckMsg() CUDA error :  : (%d) %s.\n", (int)err, cudaGetErrorString( err ) );
		}
		// 2nd kernel
		REprojectionKernel<<<dimGrid_REproj,dimBlock_REproj >>>(    d_vol_proj, //in
																	d_out, //out
																	W,
																	H,
																	Z,
																	a); 
		 err = cudaGetLastError();
		 if( cudaSuccess != err) {
			fprintf(pExec, " :CheckMsg() CUDA error :  : (%d) %s.\n", (int)err, cudaGetErrorString( err ) );		
		}
		 //update array content
		 checkCuda( cudaMemcpy3D(&copyParams),pExec);
		 //
	}

françois

Topic		Replies	Views
Copying to a 3D cuda array cudaMemcpyToArray returns cudaErrorInvalidValue CUDA Programming and Performance	4	17816	June 9, 2010
help cudaMemcpy2d Trying to modify a 2d array on cuda device CUDA Programming and Performance	8	4976	September 11, 2010
CUDA texture object with linear memory seems not to be updated when fetching CUDA Programming and Performance cuda	4	260	June 17, 2024
Updating the underlying data of a texture? Can I copy new data to the underlying array of a texture CUDA Programming and Performance	1	778	October 7, 2009
Update certain areas of a CUDA 3D array CUDA Programming and Performance	6	3750	May 23, 2012
3D Texture and memory writes Write memory bound to 3D texture CUDA Programming and Performance	3	9429	July 5, 2010
cudaMemcpy3D Incorrect returned values CUDA Programming and Performance	0	1546	February 3, 2009
Using cuMemcpy2DAsync and CUDA arrays CUDA Programming and Performance	8	4751	July 29, 2009
Overlap of Data Transfer and Kernel Execution CUDA Programming and Performance	3	1380	March 4, 2011
cudaBindTexture2D problem CUDA Programming and Performance	3	11767	August 3, 2010

cudaMemcpy3D memory duplication

Related topics