Evil in the python decimal / float

Tags:

I have a large amount of python code that tries to handle numbers with 4 decimal precision and I am stuck with python 2.4 for many reasons. The code does very simplistic math (its a credit management code that takes or add credits mostly)

It has intermingled usage of float and Decimal (MySQLdb returns Decimal objects for SQL DECIMAL types). After several strange bugs coming up from usage, I have found root cause of all to be a few places in the code that float and Decimals are being compared.

I got to cases like this:

>>> from decimal import Decimal
>>> max(Decimal('0.06'), 0.6)
Decimal("0.06")

Now my fear is that I might not be able to catch all such cases in the code. (a normal programmer will keep doing x > 0 instead of x > Decimal('0.0000') and it is very hard to avoid)

I have come up with a patch (inspired by improvements to decimal package in python 2.7).

import decimal
def _convert_other(other):
     """Convert other to Decimal.

     Verifies that it's ok to use in an implicit construction.
     """
     if isinstance(other, Decimal):
         return other
     if isinstance(other, (int, long)):
         return Decimal(other)
     # Our small patch begins
     if isinstance(other, float):
         return Decimal(str(other))
     # Our small patch ends
     return NotImplemented
decimal._convert_other = _convert_other

I just do it in a very early loading library and it will change the decimal package behavior by allowing for float to Decimal conversion before comparisons (to avoid hitting python's default object to object comparison).

I specifically used "str" instead of "repr" as it fixes some of float's rounding cases. E.g.

>>> Decimal(str(0.6))
Decimal("0.6")
>>> Decimal(repr(0.6))
Decimal("0.59999999999999998")

Now my question is: Am I missing anything here? Is this fairly safe? or am I breaking something here? (I am thinking the authors of the package had very strong reasons to avoid floats so much)

939

asked Nov 15 '10 08:11

Ehsan Foroughi

2 Answers

I think you want raise NotImplementedError() instead of return NotImplemented, to start.

What you're doing is called "monkey patching", and is OK to do, so long as you know what you're doing, are aware of the fallout, and are OK with that fallout. Generally you limit this to fixing a bug, or some other change where you know you're alteration of the behavior is still correct and backwards compatible.

In this case, because you're patching a class, you can change behavior outside of the cases where you use it. If another library uses decimal, and somehow relies on the default behavior, it might cause subtle bugs. The trouble is you don't really know unless you audit all your code, including any dependencies, and find all the call sites.

Basically - do it at your own risk.

Personally I find it more reassuring to fix all my code, add tests, and make it harder to do the wrong thing (e.g., use wrapper classes or helper functions). Another approach would be to instrument your code with your patch to find all the call sites, then go back and fix them.

Edit - I guess I should add that the probable reason they avoided floats is floats can't accurately represent all numbers, which is important if you're dealing with money.

152

answered Sep 28 '22 03:09

Richard Levasseur

There are very good reasons to avoid floats. With floats, you cannot reliably do comparisons such as ==, >, < etc. because of floating point noise. With any floating point operation you accumulate noise. It starts with very small digits appearing at the very end, e.g., 1.000...002 but it can eventually accumulate such as 1.0000000453436.

Using str() may work for you if you don't do that many floating point computations, but if you do a lot of computations, the floating point noise will eventually be big enough that str() will give you the wrong answer.

In sum, if (1) you don't do that many floating point computations, or (2) you don't need to do comparisons like ==, >, < etc then you might be ok.

If you want to be sure, then remove all floating point code.

answered Sep 28 '22 04:09

gaefan

Related questions
                            
                                How does a python web server overcomes GIL
                            
                                Why drawcontours in OpenCV doesn´t fill contours in the edge of the image?
                            
                                CPython - locking the GIL in the main thread
                            
                                Options for linting Cython code
                            
                                Psycopg2 db connection hangs on lost network connection
                            
                                How to profile a Jinja2 template?
                            
                                Python - multiprocessing for matplotlib griddata
                            
                                How to place xaxis grid over spectrogram in Python?
                            
                                What is the status of Functional Reactive Programming in Python?
                            
                                Fast logarithm calculation
                            
                                Setting values on Pandas DataFrame subset (copy) is slow
                            
                                Raspberry BLE peripherals alternative to bleno
                            
                                Use cases for __init__.py in python 3.3+
                            
                                Relative import of package __init__.py
                            
                                Use prefetch_related in django_simple_history
                            
                                Fatal error in extension: PyThreadState_Get: no current thread
                            
                                Process finished with exit code -1073740791 (0xC0000409) PyCharm
                            
                                Saving list of many python variables into excel sheet while simultaneously keeping variable types defined?
                            
                                Python setup.py install specify extras_require
                            
                                Is it possible to execute server-side javascript from a *Python* Google App Engine instance?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Evil in the python decimal / float

Tags:

python

floating-point

decimal

python-2.4

Ehsan Foroughi

People also ask

2 Answers

Richard Levasseur

gaefan

Recent Activity

Donate For Us