Hello, I have a problem. I am working on radix sort algorithm. I want to sort one million numbers. I chose a block size of 256 threads. I am currently at the stage that I have a field divided into 3,907 blocks, where the numbers are the tools are sorted in ascending order. .
Now I do not know how to effectively and parallel put these blocks together to make it as Sorting whole.
I studied a variety of literature, but it still can not understand how this algorithm do.
Thank you in advance for any advice and experience with this algorithm.