parallel find find multiple items from a array

whitewatercn · February 20, 2009, 2:38am

I have some code to find some items from a array and save it into another array. Is it possible to parallel it using CUDA?

[codebox]

vector a(N);

vector b(0);

for(int i=0;i<N;i++)

{

if(a[i]>=t)

b.add(a[i])

}

[/codebox]

Thanks

Sarnath · February 20, 2009, 4:32am

I have some code to find some items from a array and save it into another array. Is it possible to parallel it using CUDA?

[codebox]

vector a(N);

vector b(0);

for(int i=0;i<N;i++)

{

if(a[i]>=t)
b.add(a[i])
}

[/codebox]

Thanks

Oh why not. Sure it is a parallel problem. The problem will come because of synchronization among blocks. You need to find a way to solve that (sure possible)

And, it will matter only if your array is quite big in size. Otherwise, I dont think it would matter to do this in CUDA

whitewatercn · February 20, 2009, 8:55am

Thanks for the reply.

The reason why I want to do this is because the array is already generated on GPU, It will take lots of time to copy back to CPU. So I tried to copy only those needed elements ( 1% or less).

SPWorley · February 20, 2009, 10:33am

What you described is known as a “compaction”. It’s a common and useful technique.
It’s implemented in CUDPP (http://www.gpgpu.org/developer/cudpp/) but it’s useful to understand how it works in general.

Also if you’re outputting just a small fraction of results, and order doesn’t matter, you could consider using atomic increments to stuff each passing element into a device array. That’s slow in general if you have many output values, but very easy.

whitewatercn · February 23, 2009, 12:16am

Thanks. I know there must be some general algorithm for this, but I don’t know what the keyword is.

Topic		Replies	Views
Parallel selection problem CUDA Programming and Performance	3	6006	June 18, 2007
how to parallel this simple algorithm? someone has met this before? CUDA Programming and Performance	2	7291	December 3, 2010
parallel search CUDA Programming and Performance	2	1199	June 26, 2009
Can this be parallelized? CUDA Programming and Performance	24	28755	November 7, 2007
threads writing to an array position dependent on a comparison result... CUDA Programming and Performance	5	2778	March 2, 2010
A "simple" question CUDA Programming and Performance	2	1507	October 30, 2007
Unparallizable problem solvable with atomic ops? CUDA Programming and Performance	6	3862	March 12, 2008
Is this code parallelize ? CUDA Programming and Performance	2	933	June 17, 2009
Optimizing workload with Large amounts of computations but small amount of results CUDA Programming and Performance	1	496	March 25, 2018
Copying data simultaniously CUDA Programming and Performance	3	4298	July 25, 2007

parallel find find multiple items from a array

Related topics