Array addition Addition of all th elements of an array

rmd22 · August 27, 2009, 5:48am

I want to add all the elements of an array to understand how the threads work. But I think I am making some mistake in my code, which I am not able to understand.

I am trying to generate just one thread and execute it over a loop in GPU to ad all the elements of array A.

Is it possible?

if not, then why?

I would be glad if someone could take a look at my program and try to help me out with it. Thanks in advance. Here’s my program.

[codebox]#include<stdio.h>

#include<stdlib.h>

void global add(double *A, double *C, int N){

    int i;

    i=threadIdx.x;

    double add=0;

    for(i=0;i<N;i++){

    add=A[i]+add;

    }

    C[0]=add;

}

main(){

int N,i;

N=10;

double A[N],C[0];

for(i=0;i<N;i++){

A[i]=1.0;

}

double *d_A,*d_C;

size_t size=N*sizeof(double);

cudaMalloc((void**)&d_A,size);

//cudaMalloc((void**)&d_B,size);

size_t sizeC=1*sizeof(double);

cudaMalloc((void**)&d_C,sizeC);

cudaMemcpy(d_A, A, size,cudaMemcpyHostToDevice);

add<<<1,1>>>(d_A,d_C,N);

cudaMemcpy(C,d_C,sizeC,cudaMemcpyDeviceToHost);

printf("\n%f ",C[0]);

}

[/codebox]

shifter1 · August 27, 2009, 11:59am

How are you compiling your program? If you have a 9000 or lower series card, you cannot use doubles, use floats instead. If you have a 200 series card, you need to compile with a special flag that allows you to use doubles. “-arch_sm13” or something like that.

LSChien · August 27, 2009, 2:10pm

I want to add all the elements of an array to understand how the threads work. But I think I am making some mistake in my code, which I am not able to understand.

I am trying to generate just one thread and execute it over a loop in GPU to ad all the elements of array A.

Is it possible?

if not, then why?

I would be glad if someone could take a look at my program and try to help me out with it. Thanks in advance. Here’s my program.

[codebox]#include<stdio.h>

main(){

int N,i;

N=10;

double A[N],C[0];
     ^^^^^^^^^
[/codebox]

I wonder how could you compile your code

you will have two compilation errors

A[N]

N is not a constant expression
C[0]

you can not define a vector of size 0

modify your code as

[codebox]define N 10

int main()

{

int i;



double A[N] ;

double C[1] ;

…[/codebox]

then program works

my platform: winxp pro64, vc2005, driver 190.38, cuda 2.3, GTX295

kalman · August 27, 2009, 8:53pm

With a gnu compiler for example.

rmd22 · August 28, 2009, 5:16am

Thanks for the reply guys. Yes I found the problem with C[0] just after posting this code. I was able to get my code working for single precision but double precision is giving me wrong answer. I am guessing the problem is with the library path or something since I am using 64 bit AMD with fedora 10 (64 bit). If you know how to rectify this problem please post your comments and help me out. Thanks.

LSChien · August 28, 2009, 2:16pm

I test two machines

winxp pro 64, vc2005, driver 190.38, cuda 2.3, GTX295
Fedora 10 x64, gcc 4.3.2, driver 185.18, cuda2.2, GTX260

the program works for both machines, “float” and “double”

what is your configuration?

Topic		Replies	Views
Need some help with my code to add tow arrays and print them in couda (c[i] = a[i] + b[i]) CUDA Programming and Performance	1	3930	October 30, 2010
How to sum all the elements of an array CUDA Programming and Performance	4	30625	April 6, 2011
double array or float array CUDA Programming and Performance	2	866	January 10, 2014
Sum in double loop calling atomicAdd() CUDA Programming and Performance	1	903	March 18, 2012
2Arrays addition program give wrong results for large size arrays CUDA Programming and Performance	4	618	August 28, 2016
Double loop, sum on j for each i. atomicAdd()? CUDA Programming and Performance	20	7199	March 25, 2012
Why does atomicAdd not work with doubles as input? CUDA Programming and Performance	6	14181	December 21, 2017
Https://developer.nvidia.com/cuda-education Teaching & Curriculum Support	0	817	August 22, 2021
Slow exe and Error when adding array elements and asigning result to another array element CUDA Programming and Performance	0	433	December 17, 2012
The kernel isn't working CUDA Programming and Performance	9	1138	January 19, 2011

Array addition Addition of all th elements of an array

Related topics