I got a quite specific problem, so here is what I want to do:
I have a BitIndex and an input array consisting of characters. The bit index marks the positions of the chars in the array that need to be copyed to a new array. It looks like this:
0 0 0 1 1 0 1 0 … BitIndex
c0 c1 c2 c3 c4 c5 c6 c7 … Char Array
c3 c4 c6 … Result
Any idea how i could realize this in CUDA? Especially considering that the Char Array could be very large and that it might take several blocks to process it.
Thanks for your helpPicknick3r