help with cudaMemcpy2D I can't get a matrix/ array to copy correctly from host to device

supotuco · July 9, 2009, 2:26pm

I am trying to copy an array from the host into 2D device memory. The currently the data is copied but the padding is wrong. I tried reading the reference manual and I think I passed the correct parameters. They’re both square matrices(dimension x == dimension y) Here is what I have.

cudaMemcpy2D(d_mat2,pitch2,mat2,memWidth,memWidth,dim

										 ,cudaMemcpyHostToDevice);

checkCUDAError("Memcpy 2D");

d_mat2 is the matrix on the device here is the declaration

cudaMallocPitch((void **)&d_mat2,&pitch2,memWidth,dim);

pitch2 is the pitch I got when using cudaMalloc2D

mat2 is the matrix to be copied (allocated as a dynamic one dimensional array type double)

memWidth is the size of double times dim (the dimension)

dim is the dimension

Quoc_Vinh · July 10, 2009, 9:58am

I quickly read your code, Nothing wrong at all. but you may consider about the dim, memWidth and the size of matrix allocated in host.
Does you card support double precision?

int memWidth = sizeof(double) * dim;
cudaMallocPitch((void **)&d_mat2, &pitch2, memWidth, dim);
…
cudaMemcpy2D(d_mat2, pitch2, mat2, memWidth, memWidth, dim, cudaMemcpyHostToDevice);

supotuco · July 13, 2009, 12:13pm

UPDATE: I fixed it. In cudaMallocPitch the returned pitch is for bytes. So when addressing you should go for the byte address or you can divide the pitch by the size of dataType and when doing mem-copies you multiply the pitch by size of dataType and everything will align correctly.

my card does support double precision so I don’t know why I still get the error

Quoc_Vinh · July 14, 2009, 3:57am

To tell nvcc compiler supports double precision arithmetic you must set “-arch sm_13” in the compile-command line option, default, nvcc compile with single-precision arithmetic.

Topic		Replies	Views
trouble with cudaMemcpy2D I cant get a matrix to copy into 2D pitched memory CUDA Programming and Performance	1	926	July 13, 2009
Can't get copyDeviceToHost to work with cudaMemcpy2D CUDA Programming and Performance	0	3633	November 13, 2009
problem with cudaMallocPitch and cudaMemcpy2D CUDA Programming and Performance	5	6375	April 22, 2009
Copying 2D array from host to device CUDA Programming and Performance	7	7294	July 27, 2010
Using cudaMemcpy2D very strange CUDA Programming and Performance	2	1373	March 10, 2009
cudaMemcpy2D / Grid size / MxN double matrix Problem copying a MxN double matrix from Host to Device CUDA Programming and Performance	3	1328	March 18, 2010
cudaMemcpy2D error CUDA Programming and Performance	1	1131	November 11, 2009
Memcpy2D error? CUDA Programming and Performance	2	2253	July 23, 2007
2D array & Memory space Mostly about cudaMallocPitch & cudaMemcpy2D CUDA Programming and Performance	1	1494	October 15, 2009
cudaMemcpy2D help CUDA Programming and Performance	4	10609	July 28, 2009

help with cudaMemcpy2D I can't get a matrix/ array to copy correctly from host to device

Related topics