For float, multiplication and division, which is faster in CUDA?

i want to scaled vector, multiplication and division, which is faster in CUDA?

multiplication is usually faster than division.