Why it doesnt work ? Simple program that adds two vectors

BufferOverflow · March 17, 2010, 9:02pm

I wrote a simple program that should add two vectors, 1 character per 1 thread. But the value returned from kernel differs from i expected ((. Could someone explain why?

[codebox]#include <stdio.h>

#include <stdlib.h>

#include “cuda.h”

#define MAX 16 //size of arrays and number of threads

global void addVec(int *a, int *b, int *c);

int main()

{

int i;

int a_h[MAX];

int b_h[MAX];

int c_h[MAX];

int *a_d;

int *b_d;

int *c_d;

int size = MAX * sizeof(char);

//initialisation of an array

for(i = 0; i < MAX; i++){

a_h[i] = i;

b_h[i] = i;

}

cudaMalloc((void**)&a_d, size);

cudaMalloc((void**)&b_d, size);

cudaMalloc((void**)&c_d, size);

cudaMemcpy(a_d, a_h, size, cudaMemcpyHostToDevice);

cudaMemcpy(b_d, b_h, size, cudaMemcpyHostToDevice);

addVec<<<1, MAX>>>(a_d, b_d, c_d);

cudaMemcpy(c_h, c_d, size, cudaMemcpyDeviceToHost);

for(i = 0; i < MAX; i++)

printf("%d \n",*(c_h + i));

return 0;

}

global void addVec(int *a,int *b,int *c)

{

int i = threadIdx.x;



*(c + i) = *(a + i) + *(b + i);

}[/codebox]

right values are just 1-4 of resulted array. If i increase array size, number of right values increases too.

It depends from array size so that – right values = array size / 4.

This code is like example in CUDA Programming Guide.

avidday · March 17, 2010, 9:08pm

sizeof(char) isn’t the same as sizeof(int) - you are not allocating enough memory for the device, nor copying enough of the host data onto the device.

sinclair · March 17, 2010, 9:11pm

Also, I think you need to transfer the output array from the host to the device before you call your device. It might work anyways, but to be correct that’s what you’d want to do.

Matt

BufferOverflow · March 18, 2010, 8:08am

avidday, thanks a lot ! I don’t observed this mistake. I allocated incorrect amount of memory ((

Sarnath · March 18, 2010, 12:40pm

So, There was a buffer overflow…

_Big_Mac · March 18, 2010, 1:31pm

Not true. He has already allocated the output array - he doesn’t initialize it with data but there’s no need to.

sinclair · March 18, 2010, 5:54pm

I must have missed it then, my bad.

Matt

Topic		Replies	Views
help for my cuda code Teaching and Curriculum Support	2	3890	March 31, 2015
MyFirstCuda CUDA Programming and Performance	5	4197	February 11, 2010
The Cuda Programming Guide Samples Errors CUDA Programming and Performance	5	2153	August 26, 2009
Result of simple vector summation is not correct. CUDA Programming and Performance	2	779	July 23, 2013
cudaMemcpy Failing To Copy Variable From Device To Host Correctly CUDA Programming and Performance	3	2824	April 26, 2021
Problem with Vectors add Can't compute sum of two vectors CUDA Programming and Performance	4	1632	March 16, 2009
cudaMemcpy don't work CUDA Programming and Performance	4	1793	July 3, 2015
My first program with CUDA need some help CUDA Programming and Performance	3	2563	August 10, 2009
Why does my streaming vector add fails? CUDA Programming and Performance	2	2726	August 26, 2011
2matrix addition CUDA Programming and Performance	3	898	April 28, 2010

Why it doesnt work ? Simple program that adds two vectors

Related topics