Small benchmark - Division 10 divisions were repeated 10 ** 6 times. The amount of time taken by 10 ** 7 operations was shown. Scaling with multiplication showed a bit better performance. Could we get the guidline on implementation ? The fscale method is better for multiplications, and the multiplication method is better for divisions ?