Array Sum in cuda

riclas · July 4, 2008, 1:22pm

hi, i’m trying to do a function in cuda to sum all the values of an array. i have this:

__global__ void cuArraySumF_D(float *src,float *sum,int len){

	__shared__ float s;

	

	s=0;

	

	__syncthreads();

	

	for(int i = threadIdx.x; i<len; i+=N_THREADS)

  s+=src[i];

	

	__syncthreads();

	

	if(threadIdx.x == 0)

  *sum = s;

}

i’m creating N_THREADS in only one block for shared memory to work.

but it fails to run. i guess it’s because i’m writing to the same shared variable.

so, is there anyway to optimize an array sum in cuda using threads?

thanks

Sibi_A · July 4, 2008, 1:58pm

Looks like you are doing the wrong thing.
See the reduction sample in SDK. (reduction document too)
Which does the same thing you want.

May be you can reuse them… :smile2:

riclas · July 4, 2008, 2:17pm

thanks that’s really what i needed to see :)

jordyvaneijk · July 8, 2008, 2:56pm

You can also take a loot at CUDPP

s.c.o.r.p.i.o.n · May 29, 2010, 9:28pm

can you paste example sum array in cuda?

s.c.o.r.p.i.o.n · May 30, 2010, 3:13pm

I found an example of the reduction array, but I do not understand i^1

for ( i = 0; n >= BLOCK_SIZE; n /= (2*BLOCK_SIZE), i++ ){

	dim3 dimBlock (BLOCK_SIZE, 1, 1);

	dim3 dimGrid (n / (2*dimBlock.x), 1, 1);

	reduce4 <<< dimGrid, dimBlock >>> (adev[i], adev[i^1]);

}

I understand that i^2:

i | i^1

0|1

1|0

2|3

3|2

4|5

5|4

but why? can not be simpler?

Topic		Replies	Views
Combining sums CUDA Programming and Performance	1	1222	November 27, 2008
sum columns of a 2 dimensional array with Reduce algorithm CUDA Programming and Performance	0	1155	December 6, 2018
scatter and gather with CUDA? CUDA Programming and Performance	3	9992	March 9, 2009
How to sum all the elements of an array CUDA Programming and Performance	4	30452	April 6, 2011
Summing array elements using kernel Access frome the whole block grid CUDA Programming and Performance	3	852	July 16, 2010
Accumulate value within block CUDA Programming and Performance	15	3126	October 16, 2010
Problems with the summation of arrays There are no values â€‹â€‹in the array CUDA Programming and Performance	4	2716	April 27, 2012
Calculation sum of array parts have large prime number elements CUDA Programming and Performance	5	1845	December 23, 2009
computing a sum leads to infinite values CUDA Programming and Performance	3	5376	September 16, 2008
syncthreads() and += operator... CUDA Programming and Performance	6	6338	December 20, 2009

Array Sum in cuda

Related topics