small array (length 512) reduction inside loop

BlahCuda · February 23, 2011, 3:52pm

Greetings. Let’s say I have a code like this

for (int i = 0; i < 100000; i++)
{
reduction(on an array size 512);
do something useful with the summation;
randomly change some values of the array;
}

Two cases here. 1) where change of array values/location of change at each iteration is unpredictable and large such that I have to pretty much redo the reduction from scratch at each iteration and 2) change of array values/location of change at each iteration is unpredictable and small (5-10 elements) that perhaps I can use a different data structure (e.g. Fenwick tree) to do reduction.

In case 1), I am currently utilizing shared memory (1d array) and doing reduction akin to kernel found in SDK. Any other suggestion (e.g. 2d array)?

In case 2), is Fenwick tree the best data structure to utilize?

Finally, how much speedup would you expect this routine to have over a CPU one assuming both are optimized well?

EDIT: using Fermi Tesla C2050

Thanks in advance!

Topic		Replies	Views
Reduction kernel for Fermi CUDA Programming and Performance	8	1650	June 11, 2010
Reduction on small arrays CUDA Programming and Performance	1	594	February 1, 2011
How to perform multiple small reduction efficiently? CUDA Programming and Performance	3	926	May 24, 2013
Segmented Reduction of small subarrays CUDA Programming and Performance	17	1408	March 24, 2022
Would like to share my speedy reduction code Very simple code! CUDA Programming and Performance	0	1094	July 29, 2010
Any good ideas for this special "reduction" ? CUDA Programming and Performance	10	6814	November 20, 2009
Understanding and adjusting Mark Harris's array reduction CUDA Programming and Performance	11	4447	August 26, 2018
CUDA reduction CUDA Programming and Performance	10	51450	June 7, 2009
Sum reduction working in Fermi, Kepler and Maxwell CUDA Programming and Performance	10	3650	February 1, 2016
Efficient summing of a matrix CUDA Programming and Performance	1	3745	June 27, 2007

small array (length 512) reduction inside loop

Related topics