Hello to all!
I am looking for a parallel cuda code doing this:
Given the array a[1,0,0,0,0,3,0,0,5,2,2,3,1,0,0,0,0,0,2]
the output will be the array
a[1,3,5,2,2,3,1,2] (it is the same array without the zeros)
I think that this algorithm is seirial, but i need a parallel algorithm to do this. Is there any ready cuda code for something like this?
Thaink you very much in advance!