The following python code calculates the number of iterations to do stuff based on some variables. <pre class="prettyprint"><code> # a - b - c is always a multiple of d. i = (a - b - c) / d while i: # do stuff i -= 1 </code></pre> The variables will all be of the same type, that is only <code>ints</code> or <code>floats</code> or whatever. My concern is whether it will work correctly if the values are <code>floats</code>. I know enough to always consider the pitfalls of relying on exact float values. But I can't tell if the above is dangerous or not. I can use <code>i = int(round((a - b - c) / d))</code>, but I am curious as to understand floats better. It all comes down to the following: <code>a - b - c</code> is an exact multiple of <code>d</code>. So I am relying on <code>(a-b-c)/d</code> to become a value <code>i</code> that I can subtract <code>1</code> from and get the expected number of iterations in the while loop, with the implied assumption that <code>i == 0</code> becomes true. That is, can calculated multiples like this be decremented by 1 to reach exactly 0? I would like to not only know if it is unsafe, but more importantly, what do I need to understand about floating point to resolve a question like this? If someone knows decisively whether this is safe or not, would it be possible to explain how so?

You can use the decimal module to get an idea of what "hides" between a floating point number such as <code>0.3</code>: <pre class="prettyprint"><code>>>> from decimal import Decimal >>> Decimal(0.3) Decimal('0.299999999999999988897769753748434595763683319091796875') </code></pre> Note that Python 2.7 changed how floating point numbers are written (how <code>repr(f)</code> works) so that it now shows the shortest string that will give the same floating point number if you do <code>float(s)</code>. This means that <code>repr(0.3) == '0.3'</code> in Python 2.7, but <code>repr(0.3) == '0.29999999999999999'</code> in earlier versions. I'm mentioning this since it can confuse things further when you really want to see what's behind the numbers. Using the decimal module, we can see the error in a computation with floats: <pre class="prettyprint"><code>>>> (Decimal(2.0) - Decimal(1.1)) / Decimal(0.3) - Decimal(3) Decimal('-1.85037170771E-16') </code></pre> Here we might expect <code>(2.0 - 1.1) / 0.3 == 3.0</code>, but there is a small non-zero difference. However, if you do the computation with normal floating point numbers, then you do get zero: <pre class="prettyprint"><code>>>> (2 - 1.1) / 0.3 - 3 0.0 >>> bool((2 - 1.1) / 0.3 - 3) False </code></pre> The result is rounded somewhere along the way since 1.85e-16 is non-zero: <pre class="prettyprint"><code>>>> bool(-1.85037170771E-16) True </code></pre> I'm unsure exactly where this rounding takes place. As for the loop termination in general, then there's one clue I can offer: for floats less than 253, IEEE 754 can represent all integers: <pre class="prettyprint"><code>>>> 2.0**53 9007199254740992.0 >>> 2.0**53 + 1 9007199254740992.0 >>> 2.0**53 + 2 9007199254740994.0 </code></pre> The space between representable numbers is 2 from 253 to 254, as shown above. But if your <code>i</code> is an integer less than 253, then <code>i - 1</code> will also be a representable integer and you will eventually hit <code>0.0</code>, which is considered false in Python.

Floating point arithmetics: Possible unsafe reliance on specific comparison?

The following python code calculates the number of iterations to do stuff based on some variables.

  # a - b - c is always a multiple of d.
  i = (a - b - c) / d
  while i:
    # do stuff
    i -= 1

The variables will all be of the same type, that is only ints or floats or whatever. My concern is whether it will work correctly if the values are floats. I know enough to always consider the pitfalls of relying on exact float values. But I can't tell if the above is dangerous or not. I can use i = int(round((a - b - c) / d)), but I am curious as to understand floats better.

It all comes down to the following: a - b - c is an exact multiple of d. So I am relying on (a-b-c)/d to become a value i that I can subtract 1 from and get the expected number of iterations in the while loop, with the implied assumption that i == 0 becomes true. That is, can calculated multiples like this be decremented by 1 to reach exactly 0?

I would like to not only know if it is unsafe, but more importantly, what do I need to understand about floating point to resolve a question like this? If someone knows decisively whether this is safe or not, would it be possible to explain how so?

What is the main problem with floating-point representation?

Accuracy in floating point representation is governed by number of significant bits, whereas range is limited by exponent. Not all real numbers can exactly be represented in floating point format.

Why should we never compare floating-point values by using exact equality comparison?

Comparing for equality Floating point math is not exact. Simple values like 0.1 cannot be precisely represented using binary floating point numbers, and the limited precision of floating point numbers means that slight changes in the order of operations or the precision of intermediates can change the result.

What are the limitations of floating-point representation?

Floating-point numbers suffer from a loss of precision when represented with a fixed number of bits (e.g., 32-bit or 64-bit). This is because there is an infinite amount of real numbers, even within a small range like 0.0 to 0.1.

Why is floating point arithmetic inaccurate?

Because often-times, they are approximating rationals that cannot be represented finitely in base 2 (the digits repeat), and in general they are approximating real (possibly irrational) numbers which may not be representable in finitely many digits in any base.

You can use the decimal module to get an idea of what "hides" between a floating point number such as 0.3:

>>> from decimal import Decimal
>>> Decimal(0.3)
Decimal('0.299999999999999988897769753748434595763683319091796875')

Note that Python 2.7 changed how floating point numbers are written (how repr(f) works) so that it now shows the shortest string that will give the same floating point number if you do float(s). This means that repr(0.3) == '0.3' in Python 2.7, but repr(0.3) == '0.29999999999999999' in earlier versions. I'm mentioning this since it can confuse things further when you really want to see what's behind the numbers.

Using the decimal module, we can see the error in a computation with floats:

>>> (Decimal(2.0) - Decimal(1.1)) / Decimal(0.3) - Decimal(3) 
Decimal('-1.85037170771E-16')

Here we might expect (2.0 - 1.1) / 0.3 == 3.0, but there is a small non-zero difference. However, if you do the computation with normal floating point numbers, then you do get zero:

>>> (2 - 1.1) / 0.3 - 3
0.0
>>> bool((2 - 1.1) / 0.3 - 3)
False

The result is rounded somewhere along the way since 1.85e-16 is non-zero:

>>> bool(-1.85037170771E-16)
True

I'm unsure exactly where this rounding takes place.

As for the loop termination in general, then there's one clue I can offer: for floats less than 2⁵³, IEEE 754 can represent all integers:

>>> 2.0**53    
9007199254740992.0
>>> 2.0**53 + 1
9007199254740992.0
>>> 2.0**53 + 2
9007199254740994.0

The space between representable numbers is 2 from 2⁵³ to 2⁵⁴, as shown above. But if your i is an integer less than 2⁵³, then i - 1 will also be a representable integer and you will eventually hit 0.0, which is considered false in Python.

I will give you a language-agnostic answer (I don't really know Python).

There are multiple potential problems in your code. Firstly, this:

(a - b - c)

If a is (for example) 10⁹, and b and c are both 1, then the answer will be 10⁹, not 10⁹-2 (I'm assuming single-precision float here).

Then there's this:

i = (a - b - c) / d

If numerator and denominator are numbers that can't be exactly represented in floating-point (e.g. 0.3 and 0.1), then the result might not be an exact integer (it might be 3.0000001 instead of 3). Therefore, your loop will never terminate.

Then there's this:

i -= 1

Similarly to above, if i is currently 10⁹, then the result of this operation will still be 10⁹, so your loop will never terminate.

Therefore, you should strongly consider performing all the calculations in integer arithmetic.

Floating point arithmetics: Possible unsafe reliance on specific comparison?

Tags:

python

language-agnostic

floating-point

porgarmingduod

People also ask

2 Answers

Martin Geisler

Oliver Charlesworth

Recent Activity

Donate For Us

Floating point arithmetics: Possible unsafe reliance on specific comparison?

Tags:

python

language-agnostic

floating-point

porgarmingduod

People also ask

2 Answers

Martin Geisler

Oliver Charlesworth

Related questions

Recent Activity

Donate For Us