binary division in cuda


I have two array of bits(one of size 32208 and the other 192,these values come from the bch encoder according to dvbs2x standard), I want to divide the first array of bits with the second one.
Has anyone tried out binary division in cuda(gpu) or can any one point to any library that does this operation.
Thanks in advance.