I know rounding errors happen in floating point arithmetic but can somebody explain the reason for this one: <pre class="prettyprint"><code>>>> 8.0 / 0.4 # as expected 20.0 >>> floor(8.0 / 0.4) # int works too 20 >>> 8.0 // 0.4 # expecting 20.0 19.0 </code></pre> This happens on both Python 2 and 3 on x64. As far as I see it this is either a bug or a very dumb specification of <code>//</code> since I don't see any reason why the last expression should evaluate to <code>19.0</code>. Why isn't <code>a // b</code> simply defined as <code>floor(a / b)</code> ? EDIT: <code>8.0 % 0.4</code> also evaluates to <code>0.3999999999999996</code>. At least this is consequent since then <code>8.0 // 0.4 * 0.4 + 8.0 % 0.4</code> evaluates to <code>8.0</code> EDIT: This is not a duplicate of Is floating point math broken? since I am asking why this specific operation is subject to (maybe avoidable) rounding errors, and why <code>a // b</code> isn't defined as / equal to <code>floor(a / b)</code> REMARK: I guess that the deeper reason why this doesn't work is that floor division is discontinuous and thus has an infinite condition number making it an ill-posed problem. Floor division and floating-point numbers simply are fundamentally incompatible and you should never use <code>//</code> on floats. Just use integers or fractions instead.

As you and khelwood already noticed, <code>0.4</code> cannot be exactly represented as a float. Why? It is two fifth (<code>4/10 == 2/5</code>) which does not have a finite binary fraction representation. Try this: <pre class="prettyprint"><code>from fractions import Fraction Fraction('8.0') // Fraction('0.4') # or equivalently # Fraction(8, 1) // Fraction(2, 5) # or # Fraction('8/1') // Fraction('2/5') # 20 </code></pre> However <pre class="prettyprint"><code>Fraction('8') // Fraction(0.4) # 19 </code></pre> Here, <code>0.4</code> is interpreted as a float literal (and thus a floating point binary number) which requires (binary) rounding, and only then converted to the rational number <code>Fraction(3602879701896397, 9007199254740992)</code>, which is almost but not exactly 4 / 10. Then the floored division is executed, and because <pre class="prettyprint"><code>19 * Fraction(3602879701896397, 9007199254740992) < 8.0 </code></pre> and <pre class="prettyprint"><code>20 * Fraction(3602879701896397, 9007199254740992) > 8.0 </code></pre> the result is 19, not 20. The same probably happens for <pre class="prettyprint"><code>8.0 // 0.4 </code></pre> I.e., it seems floored division is determined atomically (but on the only approximate float values of the interpreted float literals). So why does <pre class="prettyprint"><code>floor(8.0 / 0.4) </code></pre> give the "right" result? Because there, two rounding errors cancel each other out. First 1) the division is performed, yielding something slightly smaller than 20.0, but not representable as float. It gets rounded to the closest float, which happens to be <code>20.0</code>. Only then, the <code>floor</code> operation is performed, but now acting on exactly <code>20.0</code>, thus not changing the number any more. <hr> 1) As Kyle Strand points out, that the exact result is determined then rounded isn't what actually happens low2)-level (CPython's C code or even CPU instructions). However, it can be a useful model for determining the expected 3) result. 2) On the lowest 4) level, however, this might not be too far off. Some chipsets determine float results by first computing a more precise (but still not exact, simply has some more binary digits) internal floating point result and then rounding to IEEE double precision. 3) "expected" by the Python specification, not necessarily by our intuition. 4) Well, lowest level above logic gates. We don't have to consider the quantum mechanics that make semiconductors possible to understand this.

After checking the semi-official sources of the float object in cpython on github (https://github.com/python/cpython/blob/966b24071af1b320a1c7646d33474eeae057c20f/Objects/floatobject.c) one can understand what happens here. For normal division <code>float_div</code> is called (line 560) which internally converts the python <code>float</code>s to c-<code>double</code>s, does the division and then converts the resulting <code>double</code> back to a python <code>float</code>. If you simply do that with <code>8.0/0.4</code> in c you get: <pre class="prettyprint"><code>#include "stdio.h" #include "math.h" int main(){ double vx = 8.0; double wx = 0.4; printf("%lf\n", floor(vx/wx)); printf("%d\n", (int)(floor(vx/wx))); } // gives: // 20.000000 // 20 </code></pre> For the floor division, something else happens. Internally, <code>float_floor_div</code> (line 654) gets called, which then calls <code>float_divmod</code>, a function that is supposed to return a tuple of python <code>float</code>s containing the floored division, as well as the mod/remainder, even though the latter is just thrown away by <code>PyTuple_GET_ITEM(t, 0)</code>. These values are computed the following way (After conversion to c-<code>double</code>s): <ol> <li>The remainder is computed by using <code>double mod = fmod(numerator, denominator)</code>.</li> <li>The numerator is reduced by <code>mod</code> to get a integral value when you then do the division.</li> <li>The result for the floored division is calculated by effectively computing <code>floor((numerator - mod) / denominator)</code> </li> <li>Afterwards, the check already mentioned in @Kasramvd's answer is done. But this only snaps the result of <code>(numerator - mod) / denominator</code> to the nearest integral value.</li> </ol> The reason why this gives a different result is, that <code>fmod(8.0, 0.4)</code> due to floating-point arithmetic gives <code>0.4</code> instead of <code>0.0</code>. Therefore, the result that is computed is actually <code>floor((8.0 - 0.4) / 0.4) = 19</code> and snapping <code>(8.0 - 0.4) / 0.4) = 19</code> to the nearest integral value does not fix the error made introduced by the "wrong" result of <code>fmod</code>. You can easily chack that in c as well: <pre class="prettyprint"><code>#include "stdio.h" #include "math.h" int main(){ double vx = 8.0; double wx = 0.4; double mod = fmod(vx, wx); printf("%lf\n", mod); double div = (vx-mod)/wx; printf("%lf\n", div); } // gives: // 0.4 // 19.000000 </code></pre> I would guess, that they chose this way of computing the floored division to keep the validity of <code>(numerator//divisor)*divisor + fmod(numerator, divisor) = numerator</code> (as mentioned in the link in @0x539's answer), even though this now results in a somewhat unexpected behavior of <code>floor(8.0/0.4) != 8.0//0.4</code>.

rounding errors in Python floor division

Tags:

python

floating-point

rounding

python-3.x

python-2.7

I know rounding errors happen in floating point arithmetic but can somebody explain the reason for this one:

>>> 8.0 / 0.4  # as expected 20.0 >>> floor(8.0 / 0.4)  # int works too 20 >>> 8.0 // 0.4  # expecting 20.0 19.0

This happens on both Python 2 and 3 on x64.

As far as I see it this is either a bug or a very dumb specification of // since I don't see any reason why the last expression should evaluate to 19.0.

Why isn't a // b simply defined as floor(a / b) ?

EDIT: 8.0 % 0.4 also evaluates to 0.3999999999999996. At least this is consequent since then 8.0 // 0.4 * 0.4 + 8.0 % 0.4 evaluates to 8.0

EDIT: This is not a duplicate of Is floating point math broken? since I am asking why this specific operation is subject to (maybe avoidable) rounding errors, and why a // b isn't defined as / equal to floor(a / b)

REMARK: I guess that the deeper reason why this doesn't work is that floor division is discontinuous and thus has an infinite condition number making it an ill-posed problem. Floor division and floating-point numbers simply are fundamentally incompatible and you should never use // on floats. Just use integers or fractions instead.

442

asked Jul 26 '16 11:07

0x539

2 Answers

As you and khelwood already noticed, 0.4 cannot be exactly represented as a float. Why? It is two fifth (4/10 == 2/5) which does not have a finite binary fraction representation.

Try this:

from fractions import Fraction Fraction('8.0') // Fraction('0.4')     # or equivalently     #     Fraction(8, 1) // Fraction(2, 5)     # or     #     Fraction('8/1') // Fraction('2/5') # 20

However

Fraction('8') // Fraction(0.4) # 19

Here, 0.4 is interpreted as a float literal (and thus a floating point binary number) which requires (binary) rounding, and only then converted to the rational number Fraction(3602879701896397, 9007199254740992), which is almost but not exactly 4 / 10. Then the floored division is executed, and because

19 * Fraction(3602879701896397, 9007199254740992) < 8.0

and

20 * Fraction(3602879701896397, 9007199254740992) > 8.0

the result is 19, not 20.

The same probably happens for

8.0 // 0.4

I.e., it seems floored division is determined atomically (but on the only approximate float values of the interpreted float literals).

So why does

floor(8.0 / 0.4)

give the "right" result? Because there, two rounding errors cancel each other out. First¹⁾ the division is performed, yielding something slightly smaller than 20.0, but not representable as float. It gets rounded to the closest float, which happens to be 20.0. Only then, the floor operation is performed, but now acting on exactly 20.0, thus not changing the number any more.

¹⁾ As Kyle Strand points out, that the exact result is determined then rounded isn't what actually happens low²⁾-level (CPython's C code or even CPU instructions). However, it can be a useful model for determining the expected³⁾ result.

²⁾ On the lowest⁴⁾ level, however, this might not be too far off. Some chipsets determine float results by first computing a more precise (but still not exact, simply has some more binary digits) internal floating point result and then rounding to IEEE double precision.

³⁾ "expected" by the Python specification, not necessarily by our intuition.

⁴⁾ Well, lowest level above logic gates. We don't have to consider the quantum mechanics that make semiconductors possible to understand this.

answered Oct 13 '22 23:10

das-g

After checking the semi-official sources of the float object in cpython on github (https://github.com/python/cpython/blob/966b24071af1b320a1c7646d33474eeae057c20f/Objects/floatobject.c) one can understand what happens here.

For normal division float_div is called (line 560) which internally converts the python floats to c-doubles, does the division and then converts the resulting double back to a python float. If you simply do that with 8.0/0.4 in c you get:

#include "stdio.h" #include "math.h"  int main(){     double vx = 8.0;     double wx = 0.4;     printf("%lf\n", floor(vx/wx));     printf("%d\n", (int)(floor(vx/wx))); }  // gives: // 20.000000 // 20

For the floor division, something else happens. Internally, float_floor_div (line 654) gets called, which then calls float_divmod, a function that is supposed to return a tuple of python floats containing the floored division, as well as the mod/remainder, even though the latter is just thrown away by PyTuple_GET_ITEM(t, 0). These values are computed the following way (After conversion to c-doubles):

The remainder is computed by using double mod = fmod(numerator, denominator).
The numerator is reduced by mod to get a integral value when you then do the division.
The result for the floored division is calculated by effectively computing floor((numerator - mod) / denominator)
Afterwards, the check already mentioned in @Kasramvd's answer is done. But this only snaps the result of (numerator - mod) / denominator to the nearest integral value.

The reason why this gives a different result is, that fmod(8.0, 0.4) due to floating-point arithmetic gives 0.4 instead of 0.0. Therefore, the result that is computed is actually floor((8.0 - 0.4) / 0.4) = 19 and snapping (8.0 - 0.4) / 0.4) = 19 to the nearest integral value does not fix the error made introduced by the "wrong" result of fmod. You can easily chack that in c as well:

#include "stdio.h" #include "math.h"  int main(){     double vx = 8.0;     double wx = 0.4;     double mod = fmod(vx, wx);     printf("%lf\n", mod);     double div = (vx-mod)/wx;     printf("%lf\n", div); }  // gives: // 0.4 // 19.000000

I would guess, that they chose this way of computing the floored division to keep the validity of (numerator//divisor)*divisor + fmod(numerator, divisor) = numerator (as mentioned in the link in @0x539's answer), even though this now results in a somewhat unexpected behavior of floor(8.0/0.4) != 8.0//0.4.

answered Oct 14 '22 00:10

jotasi

Related questions
                            
                                Pytorch softmax: What dimension to use?
                            
                                What is the difference between Python vs Jython vs IronPython vs wxPython?
                            
                                How to read from a zip file within zip file in Python? [duplicate]
                            
                                How to install MySQLdb package? (ImportError: No module named setuptools)
                            
                                Django - use reverse url mapping in settings
                            
                                Django Multiple Choice Field / Checkbox Select Multiple
                            
                                File mode for creating+reading+appending+binary
                            
                                Check for presence of a sliced list in Python
                            
                                Setting different reply-to message in Python email/smtplib
                            
                                Find the index of the n'th item in a list
                            
                                Changing the background color of the axes planes of a matplotlib 3D plot
                            
                                What is difference between os.getuid() and os.geteuid()?
                            
                                How to get index of element in Set object
                            
                                How to explain the str.maketrans function in Python 3.6?
                            
                                Dataframe set_index not setting
                            
                                Changing the directory where .pyc files are created
                            
                                How do I list all the attributes of an object in python pdb?
                            
                                In Flask, set a cookie and then re-direct user
                            
                                pandas dataframe create new columns and fill with calculated values from same df
                            
                                How do I import from a file in the current directory in Python 3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With