Python is more strongly typed than other scripting languages. For example, in Perl: <pre class="prettyprint"><code>perl -E '$c=5; $d="6"; say $c+$d' #prints 11 </code></pre> But in Python: <pre class="prettyprint"><code>>>> c="6" >>> d=5 >>> print c+d Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: cannot concatenate 'str' and 'int' objects </code></pre> Perl will inspect a string and convert to a number, and the <code>+ - / * **</code> operators work as you expect with a number. PHP is similar. Python uses <code>+</code> to concatenate strings so the the attempted operation of <code>c+d</code> fails because c is a string, d an int. Python has stronger sense of numeric types than does Perl. OK -- I can deal with that. But consider: <pre class="prettyprint"><code>>>> from sys import maxint >>> type(maxint) <type 'int'> >>> print maxint 9223372036854775807 >>> type(maxint+2) <type 'long'> >>> print maxint+2 9223372036854775809 >>> type((maxint+2)+maxint) <type 'long'> >>> print ((maxint+2)+maxint) 18446744073709551616 </code></pre> Now Python will autopromote from an int, which in this case is a 64 bit long (OS X, python 2.6.1) to a python long int which is of arbitrary precision. Even though the types are not the same, they are similar and Python allows the usual numeric operators to be used. Usually this is helpful. It is helpful with smoothing the differences between 32 bit and 64 bit for example. The conversion from <code>int</code> to <code>long</code> is one way: <pre class="prettyprint"><code>>>> type((maxint+2)-2) <type 'long'> </code></pre> Once the conversion is made, all operations on that variable are now done in arbitrary precision. The arbitrary precision operations are orders of magnitude slower than the native int operations. On a script I am working on, I would have some execution be snappy and other that extended into hours because of this. Consider: <pre class="prettyprint"><code>>>> print maxint**maxint # execution so long it is essentially a crash </code></pre> So my question: Is there a way to defeat or not allow the auto-promotion of a Python <code>int</code> to a Python <code>long</code>? Edit, follow-up: I received several comments in the form of 'why on earth would you want to have C style overflow behavior?' The issue was that this particular piece of code worked OK on 32 bits in C and Perl (with <code>use int</code>) with C's overflow behavior. There was a failed attempt to port this code to Python. Python's different overflow behavior turn out to be (part) of the problem. The code has many of those different idioms (C, Perl, some python) mixed in (and those comments mixed in), so it was challenging. Essentially, the image analysis being done is a disc based high-pass filter to perform similar image comparison. Part of the high-pass filter has an integer-based multiplication of two large polynomials. The overflow was essentially a "don't - care, it's big..." kind of logic so the result was as intended with a C-based overflow. So the use of Horner's rule with O(n2) time was a waste since the larger polynomials would just be "big" -- a rough-justice form of carot-top's saturation arithmetic. Changing the loop-based polynomial multiplication to a form of FFT is probably significantly faster. FFT runs in close to linear time vs O(n2) for Horner's rule polynomial multiply. Going from disc based to in-memory will also speed this up. The images are not terribly big, but the original code was written at a time when they were considered "huge!!!" The code owner is not quite ready to trash his beloved code, so we'll see. The 'right answer' for him probably is just keep Perl or C if he wants that code. Thanks for the answers. I did not know about Python's decimal module, and that seemed to be closest to what I was asking -- even though there are other issues to be solved in this case!

If you want arithmetic overflows to overflow within e.g. 32 bits, you could use e.g. <code>numpy.uint32</code>. That gives you a warning when an overflow occurs. <pre class="prettyprint"><code>>>> import numpy >>> numpy.uint32(2**32-3) + numpy.uint32(5) Warning: overflow encountered in ulong_scalars 2 </code></pre> I tested its speed though: <pre class="prettyprint"><code>>\python26\python.exe -m timeit "2**16 + 2**2" 1000000 loops, best of 3: 0.118 usec per loop >\python26\python.exe -m timeit "2**67 + 2**65" 1000000 loops, best of 3: 0.234 usec per loop >\python26\python.exe -m timeit -s "import numpy; numpy.seterr('ignore')" "numpy.uint32(2)**numpy.uint32(67) + numpy.uint32(2)**numpy.uint32(65)" 10000 loops, best of 3: 34.7 usec per loop </code></pre> It's not looking good for speed.

Python: Is there a way to keep an automatic conversion from int to long int from happening?

Tags:

performance

python

integer

long-integer

Python is more strongly typed than other scripting languages. For example, in Perl:

perl -E '$c=5; $d="6"; say $c+$d'   #prints 11

But in Python:

>>> c="6"
>>> d=5
>>> print c+d
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: cannot concatenate 'str' and 'int' objects

Perl will inspect a string and convert to a number, and the + - / * ** operators work as you expect with a number. PHP is similar.

Python uses + to concatenate strings so the the attempted operation of c+d fails because c is a string, d an int. Python has stronger sense of numeric types than does Perl. OK -- I can deal with that.

But consider:

>>> from sys import maxint
>>> type(maxint)
<type 'int'>
>>> print maxint
9223372036854775807
>>> type(maxint+2)
<type 'long'>
>>> print maxint+2
9223372036854775809
>>> type((maxint+2)+maxint)
<type 'long'>
>>> print ((maxint+2)+maxint)
18446744073709551616

Now Python will autopromote from an int, which in this case is a 64 bit long (OS X, python 2.6.1) to a python long int which is of arbitrary precision. Even though the types are not the same, they are similar and Python allows the usual numeric operators to be used. Usually this is helpful. It is helpful with smoothing the differences between 32 bit and 64 bit for example.

The conversion from int to long is one way:

>>> type((maxint+2)-2)
<type 'long'>

Once the conversion is made, all operations on that variable are now done in arbitrary precision. The arbitrary precision operations are orders of magnitude slower than the native int operations. On a script I am working on, I would have some execution be snappy and other that extended into hours because of this. Consider:

>>> print maxint**maxint        # execution so long it is essentially a crash

So my question: Is there a way to defeat or not allow the auto-promotion of a Python int to a Python long?

Edit, follow-up:

I received several comments in the form of 'why on earth would you want to have C style overflow behavior?' The issue was that this particular piece of code worked OK on 32 bits in C and Perl (with use int) with C's overflow behavior. There was a failed attempt to port this code to Python. Python's different overflow behavior turn out to be (part) of the problem. The code has many of those different idioms (C, Perl, some python) mixed in (and those comments mixed in), so it was challenging.

Essentially, the image analysis being done is a disc based high-pass filter to perform similar image comparison. Part of the high-pass filter has an integer-based multiplication of two large polynomials. The overflow was essentially a "don't - care, it's big..." kind of logic so the result was as intended with a C-based overflow. So the use of Horner's rule with O(n²) time was a waste since the larger polynomials would just be "big" -- a rough-justice form of carot-top's saturation arithmetic.

Changing the loop-based polynomial multiplication to a form of FFT is probably significantly faster. FFT runs in close to linear time vs O(n²) for Horner's rule polynomial multiply. Going from disc based to in-memory will also speed this up. The images are not terribly big, but the original code was written at a time when they were considered "huge!!!" The code owner is not quite ready to trash his beloved code, so we'll see. The 'right answer' for him probably is just keep Perl or C if he wants that code.

Thanks for the answers. I did not know about Python's decimal module, and that seemed to be closest to what I was asking -- even though there are other issues to be solved in this case!

819

asked Dec 06 '10 00:12

dawg

2 Answers

So you want to throw out the One True Way and go retro on overflows. Silly you.

There is no good upside to the C / C++ / C# / Java style of overflow. It does not reliably raise an error condition. For C and C99 it is "undefined behavior" in ANSI and POSIX (C++ mandates modulo return) and it is a known security risk. Why do you want this?

The Python method of seamlessly overflowing to a long is the better way. I believe this is the same behavior being adapted by Perl 6.

You can use the Decimal module to get more finite overflows:

>>> from decimal import *
>>> from sys import maxint
>>> getcontext()
Context(prec=28, rounding=ROUND_HALF_EVEN, Emin=-999999999, Emax=999999999, capitals=1,
flags=[], traps=[DivisionByZero, Overflow, InvalidOperation])

>>> d=Decimal(maxint)
>>> d
Decimal('9223372036854775807')
>>> e=Decimal(maxint)
>>> f=d**e
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/System/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/decimal.py", line 2225, in __pow__
    ans = ans._fix(context)
  File "/System/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/decimal.py", line 1589, in _fix
    return context._raise_error(Overflow, 'above Emax', self._sign)
  File "/System/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/decimal.py", line 3680, in _raise_error
    raise error(explanation)
decimal.Overflow: above Emax

You can set your precision and boundary conditions with Decimal classes and the overflow is nearly immediate. You can set what you trap for. You can set your max and min. Really -- How does it get better than this? (I don't know about relative speed to be honest, but I suspect it is faster than numby but slower than native ints obviously...)

For your specific issue of image processing, this sounds like a natural application to consider some form of saturation arithmetic. You also might consider, if you are having overflows on 32 arithmetic, check operands along the way on obvious cases: pow, **, *. You might consider overloaded operators and check for the conditions you don't want.

If Decimal, saturation, or overloaded operators don't work -- you can write an extension. Heaven help you if you want to throw out the Python way of overflow to go retro...

106

answered Oct 05 '22 13:10

the wolf

If you want arithmetic overflows to overflow within e.g. 32 bits, you could use e.g. numpy.uint32.

That gives you a warning when an overflow occurs.

>>> import numpy
>>> numpy.uint32(2**32-3) + numpy.uint32(5)
Warning: overflow encountered in ulong_scalars
2

I tested its speed though:

>\python26\python.exe -m timeit "2**16 + 2**2"
1000000 loops, best of 3: 0.118 usec per loop

>\python26\python.exe -m timeit "2**67 + 2**65"
1000000 loops, best of 3: 0.234 usec per loop

>\python26\python.exe -m timeit -s "import numpy; numpy.seterr('ignore')" "numpy.uint32(2)**numpy.uint32(67) + numpy.uint32(2)**numpy.uint32(65)"
10000 loops, best of 3: 34.7 usec per loop

It's not looking good for speed.

answered Oct 05 '22 14:10

Craig McQueen

Related questions
                            
                                Cannot load file containing pickled data - Python .npy I/O
                            
                                Which exception should be raised when a required environment variable is missing?
                            
                                is there a difference between running fastapi from uvicorn command in dockerfile and from pythonfile?
                            
                                How to use FileUpload widget in jupyter lab?
                            
                                What does frozen mean for dataclasses?
                            
                                py2exe to generate dlls?
                            
                                Showing page count with ReportLab
                            
                                Editing Photoshop PSD text layers programmatically
                            
                                how to update a Django page without a page reload?
                            
                                How can I disable the webbrowser message in python?
                            
                                Parsing email with Python
                            
                                how to deal with unicode in mako?
                            
                                Is there a python implementation to .net automapper?
                            
                                Encoding problem in app engine when submitting multipart/form-data forms
                            
                                How do I check if a sentence contains a certain word in Python and then perform an action?
                            
                                Does NumPy have an inverse of unravel_index()?
                            
                                Unable to connect to SQL Server via pymssql
                            
                                Sorting a Django QuerySet by a property (not a field) of the Model
                            
                                PyGame: translucent sprites with per pixel alpha
                            
                                Implementation of a C pre-processor in Python or JavaScript? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With