Parallel selection problem

abdelali_zahi · June 18, 2007, 12:09pm

Dear all,

I have a array of 100,000 float elements. I want to be able to construct a new array (containing either the elements or just the indices) that fulfill a certain condition (let us say the positive ones).

Is anyone aware of possible approaches to solve this problem on CUDA knowing that the number of size of the results array is not known at the beginning at the procedure and that it has to be fed in parallel with new elements.

I am pretty sure that this problem is common and that it has been studied before (database selection would be an example).

I would appreciate any help.

Thank you in advance.

YetAnotherNoob · June 18, 2007, 12:33pm

Maybe someone else will have a solution that I’m not seeing, but that doesn’t look like the type of thing a G80 would be good for, unless the test that is performed at each element is really complicated (at which point you could generate a mask array and send that back).

I guess you could break it up into a lot of little chunks, then construct pieces of the output in shared memory + dump those to their own arrays (which then get concatenated during the copy back to system memory). This may or may not be faster than just doing it on the CPU, though…

abdelali_zahi · June 18, 2007, 1:15pm

Thank you YetAnotherNoob for your reply. I agree with you that this multiselection problem is not inherently parallel. The reason why I was thinking of doing this on the GPU is that I will need to do this operation ~200 times as an intermediary step to other calculations (least squares regressions) on the graphics card. Copying the data back to system memory would be too expensive.

regards.

MisterAnderson42 · June 18, 2007, 2:15pm

Of course, it doesn’t immediately seem parallel, but there is a parallel way to do it. It’s called “stream compaction”. See this thread: [url=“The Official NVIDIA Forums | NVIDIA”]The Official NVIDIA Forums | NVIDIA

Topic		Replies	Views
parallel find find multiple items from a array CUDA Programming and Performance	4	4405	February 23, 2009
parallel search CUDA Programming and Performance	2	1190	June 26, 2009
how to parallel this simple algorithm? someone has met this before? CUDA Programming and Performance	2	7287	December 3, 2010
Optimizing workload with Large amounts of computations but small amount of results CUDA Programming and Performance	1	494	March 25, 2018
How to put specific elements from one array to another array use CUDA? CUDA Programming and Performance cuda	6	1498	October 30, 2022
Funny development problem CUDA Programming and Performance	2	1160	March 13, 2013
How to parallel a seirial code CUDA Programming and Performance	4	751	March 16, 2018
Re-arrange one dimension array CUDA Programming and Performance	2	467	October 21, 2011
Can this be parallelized? CUDA Programming and Performance	24	28739	November 7, 2007
Removing elements from a global array written across blocks CUDA Programming and Performance	5	1530	June 15, 2009

Parallel selection problem

Related topics