Optimized version of single-precision error function, erff()

Thanks for pointing that out. Comment had been deleted inadvertently during edit; restored now.

The accuracy of the core approximation used for my_erfcf() in #18 has been improved (no change to maximum observed ulp error, though).