error in using cuda mapped memory a test program for mapped memory

fly127 · April 20, 2010, 12:42pm

Hi, guys,

I wrote a test program for the use of mapped memory. However, the it failed and I don’t know why.

Here is the source code:

#include "cuda_runtime.h"

#include <stdlib.h>

#include <stdio.h>

int main(void) {

	unsigned int *h_array, *d_array, *h_array_test;

	int cudaError;

	int num = 5;

	int size = num * sizeof(unsigned int);

	cudaSetDevice(0);

	// set device flag

	cudaSetDeviceFlags( cudaDeviceMapHost );

	// allocate pinned memory

	cudaError=cudaHostAlloc( (void**) &(h_array), size, cudaHostAllocMapped );

	if (cudaError) 

		printf ("Failed to allocate pinned memory \n");

	// get device ptr

	cudaError=cudaHostGetDevicePointer( (void**) &(d_array), h_array, 0 );

	if (cudaError) 

		printf ("Failed to get device pointer \n");

	// initialize test data

	for (int i = 0; i < num; i ++)

	{

		h_array[i] = i;

	}

	// output array in host memory

	printf("array in host memory:\n");

	for (int i = 0; i < num; i ++)

	{

		printf("%d: %d\n", i, h_array[i]);

	}

	printf("\n");

	

	// output array in device memory

	//cudaMemcpy(d_array, h_array, size, cudaMemcpyHostToDevice);

	h_array_test = (unsigned int*)malloc(size);

	cudaError=cudaMemcpy( h_array_test, d_array, size, cudaMemcpyDeviceToHost );

	if (cudaError) 

		printf ("Failed to copy device memory \n");

	printf("array in device memory:\n");

	for (int i = 0; i < num; i ++)

	{

		printf("%d: %d\n", i, h_array_test[i]);

	}

	printf("\n");

}

And the result is as follows:

array in host memory:

0: 0

1: 1

2: 2

3: 3

4: 4

Failed to copy device memory

array in device memory:

0: -1163005939

1: -1163005939

2: -1163005939

3: -1163005939

4: -1163005939

Apparently, the mapping was not sucessful. Anyone can help me out? Lots of thanks!

BTW: my GPU is Quadro FX3800 and the compute capability is 1.3, which supports page-locked memory mapping

Lev · April 20, 2010, 12:50pm

There is an example in sdk with using pinned memory. Does it work on your system?

fly127 · April 20, 2010, 1:10pm

Yes, I just tried simpleZeroCopy, it works! So there must be sth wrong with my program. Still cannot find out External Media

fly127 · April 20, 2010, 2:04pm

I just modified my code a little bit, by adding a kernel call which utilizes the mapped memory. This time it works!

__global__ void modifyArray(unsigned int *arr, int N)

{

	int idx = blockIdx.x*blockDim.x + threadIdx.x;

	if (idx < N)

		arr[idx] = idx;

}

int main(void) {

	unsigned int *h_array, *d_array;

	int cudaError;

	int num = 5;

	int size = num * sizeof(unsigned int);

	cudaSetDevice(0);

	// set device flag

	cudaSetDeviceFlags( cudaDeviceMapHost );

	// allocate pinned memory

	cudaError=cudaHostAlloc( (void**) &(h_array), size, cudaHostAllocMapped );

	if (cudaError) 

		printf ("Failed to allocate pinned memory \n");

	// get device ptr

	cudaError=cudaHostGetDevicePointer( (void**) &(d_array), (void*)h_array, 0 );

	if (cudaError) 

		printf ("Failed to get device pointer \n");

	// initialize test data

	memset(h_array, 0, size);

	// output array in host memory

	printf("array in host memory:\n");

	for (int i = 0; i < num; i ++)

	{

		printf("%d: %d\n", i, h_array[i]);

	}

	printf("\n");

	// call kernel

	dim3 grid, block;

	grid.x = 1;

	block.x = num;

	modifyArray<<<grid,block>>>(d_array, num);

	cutilSafeCall(cudaThreadSynchronize());

	// output array in device memory

	printf("array in host memory after kernel call:\n");

	for (int i = 0; i < num; i ++)

	{

		printf("%d: %d\n", i, h_array[i]);

	}

	printf("\n");

The result is:

array in host memory:

0: 0

1: 0

2: 0

3: 0

4: 0

array in host memory after kernel call:

0: 0

1: 1

2: 2

3: 3

4: 4

Does this mean: only the kernel call would initiate the automatic data copy between the device and the host?

Topic		Replies	Views
error in using cuda mapped memory a test program for mapped memory CUDA Programming and Performance	0	1013	April 20, 2010
can I use pinned memory? CUDA Programming and Performance	6	2727	September 21, 2009
problem with mappe memory CUDA Programming and Performance	3	8918	March 22, 2011
Experiments with mapped memory CUDA Programming and Performance	0	893	June 10, 2010
zero copy : Device 0 cannot map host memory! zero copy not working, unable to map host memory? CUDA Programming and Performance	4	6567	June 9, 2009
Problem using zero-copy / mapped memory Cuda 2.2 beta CUDA Programming and Performance	5	13490	March 19, 2009
Pinned memory error invalid device pointer CUDA Programming and Performance	9	6201	April 10, 2009
Mapped memory across multiple GPUs CUDA Programming and Performance	3	8800	October 28, 2010
Wrong output when copy array with cuda CUDA Programming and Performance	0	406	January 21, 2021
Does pinned memory can accessed by Device? CUDA Programming and Performance	4	1955	March 18, 2024

error in using cuda mapped memory a test program for mapped memory

Related topics