Avoiding cudaMemcpy2D() because of 65536 pitch limit

DoZo1971 · April 13, 2009, 8:07pm

Hi,

I am having trouble with the 65536 pitch limit of the cudaMemcpy2D() function. I allocate a matrix that is very wide (>1000000) but not very high (<10) with cudaMallocPitch. This allocation gives no errors. Then (after the kernels finish) i would like to copy only the first row of the matrix back to host memory. Is there any trick to do this without cudaMemcpy2D()?

Kind regards,

Daniel Dekkers

DoZo1971 · April 14, 2009, 10:44am

… continued …

It seems i can simply use cudaMemcpy() to copy the first row (without padding bytes) from the device back to the host. It works for arbitrary matrix row widths (>65536). Why does this 65536 float pitch boundary in cudaMemcpy2D() exist anyway?

Kind regards,
Daniel Dekkers

Topic		Replies	Views
Why cudaMemcpy2D cause "invalid pitch argument"? CUDA Programming and Performance	2	6696	June 10, 2008
Can't get cuMemcpy2D to Work CUDA Programming and Performance	1	2065	October 7, 2010
cudaMemcpy2D(): reason for pitch limits? CUDA Programming and Performance	3	7190	August 20, 2007
cudaMemcpy2D invalid pitch argument CUDA Programming and Performance opencv , cuda	3	901	June 2, 2022
problem with cudaMallocPitch and cudaMemcpy2D CUDA Programming and Performance	5	6418	April 22, 2009
Using cudaMemcpy2D very strange CUDA Programming and Performance	2	1406	March 10, 2009
trouble with cudaMemcpy2D I cant get a matrix to copy into 2D pitched memory CUDA Programming and Performance	1	958	July 13, 2009
question on copy a matrix, which copy function to use CUDA Programming and Performance	1	668	April 18, 2016
need help for cudaMemcpy2D() CUDA Programming and Performance	5	4644	December 8, 2009
Problem with 2D memory copy using pitch CUDA Programming and Performance	6	6588	November 20, 2011

Avoiding cudaMemcpy2D() because of 65536 pitch limit

Related topics