need some help with cudaMemcpy/cudamemcpy2D

xargon · June 8, 2010, 6:34pm

Hello,

I have been struggling for a while with this. I have a C++ array that is allocated dynamically as follows:

[codebox]

float ** A = new float *[equations];

for (unsigned k = 0; k < equations; ++k)

{

A[k] = new float[parameters];

}

[/codebox]

Now, what I want to do is transfer this to the device. However, I have been unsuccessful in doing so:

I tried cudaMemcpy and cudamemcpy2D. With cudamemcpy2D, i tried the following:

[codebox]

float *d_A = 0;

cudaMalloc((void**)d_A, equationsparameterssize));

cudaMemcpy2D(d_A, equations * sizeof(float), A, equations * sizeof(float),

                    equations, parameters, cudaMemcpyHostToDevice));

[/codebox]

However, when I examine the copied values, they are rubbish.

Does anyone know what I should do to achieve this?

Thanks,

Luca

tera · June 9, 2010, 12:07am

An array of pointers is not the same thing as a two-dimensional array. As you allocate each of the 1d stripes separately in the host, you will have to do just the same on the device:

[codebox]

float **d_A = 0;

for (unsigned k = 0; k < equations; ++k)

{

cudaMalloc(&(d_A[k]), parameters*sizeof(float));

cudaMemcpy(d_A[k], A[k], equations * sizeof(float), cudaMemcpyHostToDevice);

}

[/codebox]

It might be worth turning this into one big allocation of an array with equations*parameters floats, so that you can alloc (and later copy) them in one go, instead of per-stripe.

And it’s definitely worth to add error-checking.

xargon · June 9, 2010, 10:52pm

An array of pointers is not the same thing as a two-dimensional array. As you allocate each of the 1d stripes separately in the host, you will have to do just the same on the device:

[codebox]

float **d_A = 0;

for (unsigned k = 0; k < equations; ++k)

{
cudaMalloc(&(d_A[k]), parameters*sizeof(float));

cudaMemcpy(d_A[k], A[k], equations * sizeof(float), cudaMemcpyHostToDevice);
}

[/codebox]

It might be worth turning this into one big allocation of an array with equations*parameters floats, so that you can alloc (and later copy) them in one go, instead of per-stripe.

And it’s definitely worth to add error-checking.

Thanks. I ended up factoring my code to change it into a single linear array.

Topic		Replies	Views
Question about cudaMemcpy2D CUDA Programming and Performance	3	1649	November 9, 2017
Passing dynamically allocated 2D array to device CUDA Programming and Performance	2	2895	July 9, 2016
Allocating a multidimensional array onto a device variable CUDA Programming and Performance	6	1598	July 15, 2015
Help with cuda 2d array CUDA Programming and Performance	6	7452	September 29, 2014
How to use 2D Arrays wrapped in structs in CUDA? CUDA Programming and Performance	4	1474	October 17, 2017
help cudaMemcpy2d Trying to modify a 2d array on cuda device CUDA Programming and Performance	8	4976	September 11, 2010
Array of pointers and memcpy CUDA Programming and Performance	2	2106	August 28, 2008
Dealing with 3d pointer array in CUDA CUDA Programming and Performance	5	588	April 14, 2023
2D array indexing with double pointers CUDA Programming and Performance	1	1376	February 11, 2010
2D host memory allocation CUDA Programming and Performance	3	2673	February 25, 2009

need some help with cudaMemcpy/cudamemcpy2D

Related topics