Device-Level Reordering Algorithms

Ziqi · September 4, 2024, 5:31pm

In general, this question is about sorting. But what I am seeking is not a sorted array, but a reordering for sorting. My definition of reordering goes as follows:

First of all, how to generate this reordering permutation from a sequence? Second, is there a good parallel algorithm to generate this reordering? I need this reordering, not sorted array, because in DoA, it may be that only one array is used to extract this reordering, and we need to sort all arrays based on this reordering.

Curefab · September 4, 2024, 5:58pm

Can you add details and a few numbers?

What size does the set S have, what is the maximum integer number within S? Dependent on it you can use register-local or shared memory or global lookup-tables.

Sort pairs of numbers: For each element in the one array, create a pair of (value, initial position), then sort those pairs by the value.

After sorting you remove the values from the pairs and just keep the initial positions in their order after sorting.

(Or instead of using pairs, create a lookup-table between value and initial position and resolve after normal sorting to get a function between initial and final position.)

1 < 2 => s(r(1)) < s(r(2)): r(k) is the initial position of the element, which lands at final position k after sorting. s(r(k)) is the value of the element. s(1..n) is the initial sequence s, s(r(1..n)) is the rordered and sorted final sequence.

You can invert the function r (it is an bijection) to get the final position of a certain element or create a function, which takes a value as argument.

Should be well parallelizable.

Ziqi · September 4, 2024, 9:48pm

The size of S can be huge (millions or more).

Curefab · September 4, 2024, 9:59pm

How do you create this reordering from the one array?

The one array contains the sequence? And all other arrays have to be reordered the same way?

A = 1 5 4 3 2
B = 7 3 9 1 4 => 7 4 1 9 3

Or

A = 10 50 40 30 20
B = 7 3 9 1 4 => 7 4 1 9 3

Or (probably not)

A = 1 5 4 3 2
B = 7 3 9 1 4 => 7 4 1 9 3

Does the first array (continuously) have all integer numbers up to millions or are there gaps?

Probably this answer solves your problem (with initial keys 0…n-1), afterwards the sorted keys are the indices into the other arrays in order:

Topic		Replies	Views
sorting 2d array in CUDA CUDA Programming and Performance	5	14021	February 15, 2011
Can CUDA do permutations CUDA Programming and Performance	11	12566	March 30, 2012
How can i sort an array with CUDA? Who can tell me? CUDA Programming and Performance	5	7248	June 26, 2008
Sorting in CUDA Is sorting in CUDA worth the trouble? CUDA Programming and Performance	15	8068	September 30, 2009
sorting a small shared array CUDA Programming and Performance	1	1921	October 22, 2009
Re-arrange one dimension array CUDA Programming and Performance	2	451	October 21, 2011
Compaction Issue CUDA Programming and Performance	1	2123	June 2, 2011
Algorithm query... CUDA Programming and Performance	3	449	March 17, 2011
Reduction question CUDA Programming and Performance	3	3475	March 5, 2009
Newbie to CUDA - Help wanted Suggestions and help with implementing a parallel merge sort CUDA Programming and Performance	1	684	December 23, 2010

Device-Level Reordering Algorithms

Related topics