newb question parallel add array in cuda

I was wondering if there was a way of summing an array like you would with say pthreads in c instead of using reduction if so could some explain how if not then another simple parallel program that I could use for a demonstration in a presentation.


Check out the samples in the SDK. One of them shows how to do reductions, the others you can also use for demonstration.