Problem with cudaHostAlloc Problem with Memcpy

Elrachal · July 2, 2012, 2:00pm

Hi there,

I’ve a little problem by using cudaHostAlloc:

There is my code :

__device__ int addDevice( int a, int b ) {

    return a + b;

}

__global__ void add( int a, int b, int *c ) {

    *c = addDevice( a, b );

}

int main( void ) {

    int c;

    int *dev_c;

    HANDLE_ERROR( cudaHostAlloc( (void**)&dev_c, sizeof(int), cudaHostAllocDefault ) );

add<<<1,1>>>( 1, 9, dev_c );

HANDLE_ERROR( cudaMemcpy( &c, dev_c, sizeof(int),

                              cudaMemcpyDeviceToHost ) );

    printf( "1 + 9 = %d\n", c );

    HANDLE_ERROR( cudaFreeHost( dev_c ) );

return 0;

}

It seems that the problem comes from the Memcpy function : invalid argument.

Has anyone an Idea where the problem comes from?

Cheers

Elrachal

RezaRob3 · July 2, 2012, 3:20pm

dev_c is already on the host, so DeviceToHost doesn’t seem to make sense.

Elrachal · July 2, 2012, 3:30pm

But dev_c represents the value of c int the kernel.

By writing deviceToHost, I am upgrading the value of dev_c with the value of c I obtained in the kernel. Or am I wrong?

wanderine · July 2, 2012, 9:08pm

You have allocated memory on the host instead of on the device?

cbuchner1 · July 2, 2012, 10:20pm

But cudaHostAlloc is supposed to return a device pointer to memory that is pinned at the host side. So essentially any cudaMemcpy’s execute faster.

I can’t spot any mistake in the original poster’s code.

tera · July 2, 2012, 11:07pm

cudaHostAlloc() returns a host pointer, even if the [font=“Courier New”]cudaHostAllocMapped[/font] flag were specified (which isn’t in the example above). You still need to call cudaHostGetDevicePointer() to obtain the corresponding device pointer for mapped memory. Only under certain conditions (UVA) will these pointers be the same.

cbuchner1 · July 2, 2012, 11:14pm

Ah, thanks for clarifying. it’s been a while since I last used this function.

Topic		Replies	Views
cudaHostAlloc and memcpy CUDA Programming and Performance	1	1180	June 29, 2015
CUDA class - allocate memory using malloc (Dynamic Global Memory Allocation and Operations) CUDA Programming and Performance	3	3225	February 2, 2017
Memeory allocation on Host Memory allocation to Host to Device Transfer CUDA Programming and Performance	2	1422	December 10, 2009
cudaMallocHost How to use CUDA Programming and Performance	6	35793	April 26, 2012
Problem CudaMallocHost CUDA Programming and Performance	4	2185	July 14, 2015
cudaMemcpy to device allocated memory (via malloc) fails with CUDA Programming and Performance	1	646	June 25, 2021
Invalid Argument after calling cudaMalloc on device but not host CUDA Programming and Performance	1	1854	March 30, 2017
Accessing GPU global memory allocated on device - by host CUDA Programming and Performance	3	1277	June 3, 2013
cuMemAllocHost, how to use ? CUDA Programming and Performance	3	5018	October 29, 2007
Host Memory mapping to GPU CUDA Programming and Performance	3	6102	February 3, 2012

Problem with cudaHostAlloc Problem with Memcpy

Related topics