Correct, -use_fast_math implies -ftz=true -prec-div=false -prec-sqrt=false. I will followup with the documentation team to check where this is documented, or if it’s not documented yet, get it properly documented for the next CUDA release.
The only documentation I could find is via nvcc --help (see below). I will file a bug to get this included in the appropriate document for the next version of CUDA. Thanks for alerting us to this documentation issue.