I have arrays in global memory, and I want to sort them in parallel.
Say I have 12 float arrays, each has 1k data. I can use CUDPP to sort them one by one. But is it possible to sort them simultaneously?
Especially when the size of each array is not so big, sorting sequentially would be a waste of threads.