Cannot modify a value in Page locked memory

h4p0 · June 4, 2010, 10:36am

[Moving the thread to the right section: CUDA Programming and Development ]

link

I have a problem.

h_value remains 0 … why?

Could anyone help me please?

Code:

[codebox]

void my_function(void){

int *h_value = NULL;

int *d_value = NULL;

dim3 block(1,1);

dim3 grid(1,1);

cudaSetDeviceFlags(cudaDeviceMapHost);

cudaHostAlloc((void**) &h_value, sizeof( int ) , cudaHostAllocMapped | cudaHostAllocPortable);

cudaHostGetDevicePointer( &d_value , h_value , 0 );

    //init value on host

*h_value=0;

my_kernel<<<grid,block>>>( d_value );

printf("h_value = %d \n",*h_value);

}

[/codebox]

[codebox]global void ky_kernel( int *value )

{

*value=1;

}

[/codebox]

Tnk’s.

Nico · June 4, 2010, 10:52am

Run the deviceQuery example to see whether your gpu supports host page-locked memory mapping.
See section 3.2.6.3 of the programming guide:

N.

h4p0 · June 4, 2010, 11:13am

Many tnk’s

added a cudaThreadSynchronize(); after the kernel and go!

h4p0 · June 4, 2010, 5:59pm

I have a problem.

h_value remains 0 … why?

Could anyone help me please?

Code:

[codebox]

void my_function(void){
int *h_value = NULL;

int *d_value = NULL;

dim3 block(1,1);

dim3 grid(1,1);
cudaSetDeviceFlags(cudaDeviceMapHost);
cudaHostAlloc((void**) &h_value, sizeof( int ) , cudaHostAllocMapped | cudaHostAllocPortable);

cudaHostGetDevicePointer( &d_value , h_value , 0 );

    //init value on host

*h_value=0;

my_kernel<<<grid,block>>>( d_value );

printf("h_value = %d \n",*h_value);
}

[/codebox]

[codebox]global void ky_kernel( int *value )

{

*value=1;

}

[/codebox]

Tnk’s.

As Nico said was a problem of synchronization and inserting a cudaSynchronizeThread(); after the kernel execution permits to the host to read the correct value.

But now I’ve a new problem:

[codebox]

void my_function(void){

int *h_value = NULL;

int *d_value = NULL;

    int size=1024;

dim3 block(1,1);

dim3 grid(1,1);

cudaMalloc((void**)&d_input_data, size); //< This line makes kernel increment fail!

cudaSetDeviceFlags(cudaDeviceMapHost);

cudaHostAlloc((void**) &h_value, sizeof( int ) , cudaHostAllocMapped | cudaHostAllocPortable);

cudaHostGetDevicePointer( &d_value , h_value , 0 );

    //init value on host

*h_value=0;

my_kernel<<<grid,block>>>( d_input_data , d_value );

printf("h_value = %d \n",*h_value);

}

[/codebox]

[codebox]global void ky_kernel( char* device_array, int *value )

{

*value=1;

}

[/codebox]

Simply adding a cudaMalloc() and a parameter to kernel the mapped variable cannot be modified (!!!) and h_values remains 0.

Some hint?

Topic		Replies	Views
CudaMalloc() makes page-locked memory fail simple variable assignement CUDA Programming and Performance	0	805	June 4, 2010
problem with mappe memory CUDA Programming and Performance	3	8925	March 22, 2011
cuda page-locked memory don't work with me CUDA Programming and Performance	2	1465	April 8, 2012
Mapped Memory, CPU->GPU Example CUDA Programming and Performance	4	2711	January 4, 2013
CUDA multiple gpus page-locked memory malloc and free CUDA Programming and Performance cuda	0	340	August 14, 2020
Question about Mapped Memory CUDA Programming and Performance	1	9178	May 17, 2010
How to pass two flags to cudaHostAlloc()? CUDA Programming and Performance	5	9316	June 17, 2009
Pinned (page locked) memory and CUDA arrays problem CUDA Programming and Performance	3	15870	April 14, 2011
cudaHostRegister returns cudaErrorInvalidValue CUDA Programming and Performance	14	2980	January 28, 2021
How to use cudaHostGetPointer() on memory not allocated by cudaHostAlloc CUDA Programming and Performance	2	2284	August 10, 2010

Cannot modify a value in Page locked memory

Related topics