cudaMemcpy question

atoo · February 25, 2010, 8:21am

I am trying to learn Cuda. I have the following code,

global void DivideByPivot(float *d_m, float pivot, int N)
{
int j = blockIdx.x * blockDim.x + threadIdx.x;
if (j < N) d_m[j] /= pivot;
}

I want to read back the value in d_m[7] from the device. Do I have to use cudaMemcpy to do that?
How can I then assign d_m[7] = 1.0 Do I have to do cudaMemcpy again?

Is there a simpler way to read and write a single value from and to the device memory?

Thanks…

CUDA_VST_RT · February 25, 2010, 10:07am

I am trying to learn Cuda. I have the following code,

global void DivideByPivot(float *d_m, float pivot, int N)

{
    int j = blockIdx.x * blockDim.x + threadIdx.x;

    if (j < N) d_m[j] /= pivot;
}

I want to read back the value in d_m[7] from the device. Do I have to use cudaMemcpy to do that?

How can I then assign d_m[7] = 1.0 Do I have to do cudaMemcpy again?

Is there a simpler way to read and write a single value from and to the device memory?

Thanks…

Here is what you do:

Host Application does the CudaMemcpy (HosttoDevice)

then you can access the Data within your Kernel by using the index dm[index]

to get the data back to the host the host code does:

CudaMemcpy DevicetoHost

You only need to copy your data once, then use it in the kernel and then copy it back.

i suggest you look into some example codes.

atoo · February 25, 2010, 5:03pm

Hi,

I don’t need to read back the whole array. I need to read back only a single value from the device like x = d_m[7].
I also want to write to a single position on the device like d_m[7] = x; Do I need to call cudaMemcpy for these?

CUDA_VST_RT · February 25, 2010, 5:13pm

mh ok maybe i need more informations.

you can also just readback a single item.

cudaMemcpy(a_device , a_Host , MEMsize, cudaMemcpyDeviceToHost);

this would copy the a_device Array from gpumemory to hostmemory. the size is defined by for example MEMSize = 10 * sizeof(float)
for an float array with 10 elements.

but i dont really get it what you want to do, and why!

You only need cudaMemcpy when transferring data from Device to Host or vice versa.

once your data is on the device you can acces and edit it. just by indexing!

atoo · February 25, 2010, 6:49pm

//*** Host **********************************************
pivot = d_Gs[ind + icol];
d_Gs[ind + icol] = 1.0;
DivideByPivot <<< Blocks, BlockSize >>> ((d_Gs + ind), pivot, NP);
if (d_Gs[ind + icol] == 1)
DivideByPivot <<< Blocks, BlockSize >>> ((d_Gs + ind), (pivot * 0.5) , NP);

/********************************************************

global void DivideByPivot(float *d_m, float pivot, int N)
{
int j = blockIdx.x * blockDim.x + threadIdx.x;
if (j < N) d_m[j] /= pivot;
}

d_Gs is a device memory area. I want to read a single value to a variable called pivot. I also want to write a value to a single device location d_Gs[ind + icol] = 1.0;

CUDA_VST_RT · February 26, 2010, 2:56pm

You cant access any device memory from the host, without cudamemcpyDeevicetohost.

d_Gs[ind + icol] = 1.0; doesnt work. you must first copy d_Gs back to the host.

Topic		Replies	Views
How to copy a single variable from device to host? CUDA Programming and Performance	2	7259	March 26, 2010
The most basic problem,ask for help CUDA Programming and Performance	5	2087	February 2, 2009
__constant__ and __device__ memory access CUDA Programming and Performance	4	5864	April 10, 2012
''cudaMemcpy'' failed to copy from device memory dynamically allocate using ''malloc'' CUDA Programming and Performance	5	458	October 25, 2022
copy device memory to constant memory CUDA Programming and Performance	4	12465	November 11, 2008
cudaMemcpy does not copy data from the host to device CUDA Programming and Performance	6	6968	June 20, 2012
cudaMemcpyFromSymbol painful problem CUDA Programming and Performance	6	3350	December 21, 2009
How to use cudaMemcpyFromSymbol with global device variable? CUDA Programming and Performance	1	1022	December 8, 2013
cudaMallocHost How to use CUDA Programming and Performance	6	35080	April 26, 2012
cudaMemcpy to device allocated memory (via malloc) fails with CUDA Programming and Performance	1	555	June 25, 2021

cudaMemcpy question

Related topics