quads in cuda?

If it is possible that you can compute the terms of the infinite sums independently and use parallel reductions to sum them up.
Parallel reductions help immensely with round off error because at each step floating point numbers of similar magnitude are added.