What's wrong with this extremely simple program?

andrew732 · July 16, 2010, 7:25pm

I have a tiny program that moves some data into the global device memory, changes it slightly using one thread in one block, and then reads it back. Here is the code and the output, which shows that the data is only partially changed for some strange reason. What really dumb mistake am I making? If it makes a difference my machine is Red Hat Enterprise 5.4 with a Quadro FX 580, driver 256.35, Cuda 3.1. Thanks~

#include <stdio.h>

#include <cuda.h>

__global__ void kernel(float *x);

int main(void)

{

	int i, size;

	float *host, *device;

	host = (float *) malloc(4 * sizeof(float));

	for(i = 0; i < 4; ++i) { host[i] = 1.0; }

	printf("host before:\n");

	for(i = 0; i < 4; ++i) { printf("%f\n", host[i]); }

	size = sizeof(host);

	cudaMalloc(&device, size);

	cudaMemcpy(device, host, size, cudaMemcpyHostToDevice);

	kernel<<<1, 1>>>(device);

	cudaMemcpy(host, device, size, cudaMemcpyDeviceToHost);

	printf("host after:\n");

	for(i = 0; i < 4; ++i) { printf("%f\n", host[i]); }

	cudaFree(device);

	free(host);

	return 0;

}

__global__ void kernel(float *x)

{

	x[0] = 2.0;

	x[1] = 2.0;

	x[2] = 2.0;

	x[3] = 2.0;

}

Gives this output:

host before:

1.000000

1.000000

1.000000

1.000000

host after:

2.000000

2.000000

1.000000

1.000000

andrew732 · July 16, 2010, 7:34pm

I knew it would be something really dumb. I should be doing size = 4 * sizeof(float); instead of size = sizeof(host);
:">

tera · July 16, 2010, 7:36pm

Try this:
[font=“Courier New”] size = 4 * sizeof(*host);[/font]

EDIT: glad you found it… External Image

Topic		Replies	Views
My first program with CUDA need some help CUDA Programming and Performance	3	2563	August 10, 2009
copy object from host to device CUDA Programming and Performance	4	1861	August 18, 2010
Reordering a vector. kernel working only for single precision CUDA Programming and Performance	0	1313	July 30, 2011
Can't copy device memory to host memory CUDA Programming and Performance	2	3098	June 10, 2009
Newbie: Error while device to host memcopy CUDA Programming and Performance	2	1925	July 18, 2008
memory access error CUDA Programming and Performance	11	1395	January 12, 2013
kernel only executes successfully once, then cudaMemcpy segfaults CUDA Programming and Performance	2	3167	March 31, 2009
Newbie question about data transfer CUDA Programming and Performance	4	2701	July 25, 2008
cudaMemcpy question CUDA Programming and Performance	5	12155	February 26, 2010
CUDA and Quadro FX 580 memory usage/corruption CUDA Programming and Performance	0	710	July 27, 2014

What's wrong with this extremely simple program?

Related topics