cudaMemcpy3D behaviour

ojaswa · October 11, 2008, 8:11pm

Hi everyone,

I’m wondering how would cudaMemcpy3D behave if it is asked to copy a volume of size W1xH1xD1 from host (source volume size WxHxD, W>W1, H>H1, D>D1) to device (destination volume size W1xH1xD1) starting at a source offset of (Wo, Ho, Do), such that part of the volume to be copied lies outside the source.

Will cudaMemcpy detect it and copy only the relevant portion?
Will the destination part of volume outside the copied portion be set to zero or kept unchanged?

or is it a responsibility of the developer to take care that the subvolume to be copied lies completely inside the volume?

Many thanks,
Ojaswa

ojaswa · October 11, 2008, 9:11pm

My observation:

cudaMemcpy3D does not warn or complain in such a case, but the values inside the source volume are correctly copied.
Outside volume values are not set to anything, but perhaps these are taken from the memory location falling outside the bounds, which could be anything.

The question now is that, is it safe to do such copies (in terms of system stability, etc.)?

I fall into the trap of such copies since I need to manage copies from a large volume with a subvolume size that needs to be changed later (to save memory requirement of the application).

-Ojaswa

paulius · October 15, 2008, 12:33am

It’s the programmers responsibility to avoid out-of-bounds accessesk, just like it is in C. The only time you get bounds checking in CUDA is when using texture fetch functions.

Paulius

sudoca · March 19, 2009, 9:13am

Still dealing about cudaMemcpy3D : I wish to copy memory from device to host; the device memeory is allocated with Malloc3d so it might be padded for alignement, but the host memory is allocated as 1D data since for further processing simplicity.

The result fails as soon as device memory is padded… Is there a solution ?

ojaswa · March 19, 2009, 1:38pm

Can you paste your copying code? The padding must be done by cuda, so that shouldn’t much of a big deal. Just make sure that you create a cudaPitchedPointer from the host memory pointer like this:

copyParams.dstPtr = make_cudaPitchedPtr((void*)h_target, dst_extent.width*sizeof(float), dst_extent.width, dst_extent.height);

Good luck!

-Oj

sudoca · March 20, 2009, 8:31am

I checked the memory copy and it actually seems to work properly. The problem comes from the addressing of the elements in the allocated memory (see post : http://forums.nvidia.com/index.php?showtopic=87178&hl=). I still didn’t find an issue, and using standard 1D allocation doubles the computation time…

Here is the code though for the memcpy, which works ok: (still nice to find code samples…)

cudaPitchedPtr ptrdevVolFloat;

	cudaExtent VolExtent = make_cudaExtent(plan->VolumeInfo->SizeX*sizeof(float), plan->VolumeInfo->SizeY, plan->VolumeInfo->SizeZ); 

	CUDA_SAFE_CALL(cudaMalloc3D(&ptrdevVolFloat, VolExtent));

(...)

cudaPitchedPtr flVolDataPitchedPtr= make_cudaPitchedPtr( (void*)ReconstructedSubVol, plan->VolumeInfo->SizeX*sizeof(float), plan->VolumeInfo->SizeX, plan->VolumeInfo->SizeY );

	

	cudaMemcpy3DParms MemcpyParams = {0};

	MemcpyParams.dstPtr = flVolDataPitchedPtr;

	MemcpyParams.srcPtr = ptrdevVolFloat;

	MemcpyParams.extent = VolExtent;

	MemcpyParams.kind = cudaMemcpyDeviceToHost;

	CUDA_SAFE_CALL(cudaMemcpy3D(&MemcpyParams ));

Topic		Replies	Views
copy pointer to pointer on device CUDA Programming and Performance	8	3226	April 16, 2009
copying memory to and from 3D pitched pointers CUDA Programming and Performance	6	6819	August 2, 2011
cudaMemcpy2D() and a few gray hairs It's very slow CUDA Programming and Performance	8	4538	February 13, 2009
cudaMallocPitch() + cudaMemcpy2D() Want to check if the copy of 2D data between host and dev is work CUDA Programming and Performance	2	5601	June 20, 2012
cudaMemcpy3D problem CUDA Programming and Performance	4	2919	March 22, 2009
Problem about cudaMemcpy3D() CUDA Programming and Performance	9	7097	October 26, 2008
Error when attempting to use cudaMemCpy() CUDA Programming and Performance	5	58	August 9, 2024
Undocumented function CUDA Programming and Performance	2	3446	August 21, 2008
cuMemallocPitch for 3D allocations? CUDA Programming and Performance	2	7021	June 23, 2008
Potential bug with 3D memcpy in Linux 64-bit Ubuntu (CUDA 2.1) CUDA Programming and Performance	3	1648	July 28, 2009

cudaMemcpy3D behaviour

Related topics