Why does Python handle '1 is 12' differently from '1000 is 103'?

Tags:

Inspired by this question about caching small integers and strings I discovered the following behavior which I don't understand.

>>> 1000 is 10**3 False

I thought I understood this behavior: 1000 is to big to be cached. 1000 and 10**3 point to 2 different objects. But I had it wrong:

>>> 1000 is 1000 True

So, maybe Python treats calculations differently from 'normal' integers. But that assumption is also not correct:

>>> 1 is 1**2 True

How can this behavior be explained?

483

asked Feb 19 '14 12:02

OrangeTux

1 Answers

There are two separate things going on here: Python stores int literals (and other literals) as constants with compiled bytecode and small integer objects are cached as singletons.

When you run 1000 is 1000 only one such constant is stored and reused. You are really looking at the same object:

>>> import dis >>> compile('1000 is 1000', '<stdin>', 'eval').co_consts (1000,) >>> dis.dis(compile('1000 is 1000', '<stdin>', 'eval'))   1           0 LOAD_CONST               0 (1000)                3 LOAD_CONST               0 (1000)                6 COMPARE_OP               8 (is)                9 RETURN_VALUE

Here LOAD_CONST refers to the constant at index 0; you can see the stored constants in the .co_consts attribute of the bytecode object.

Compare this to the 1000 is 10 ** 3 case:

>>> compile('1000 is 10**3', '<stdin>', 'eval').co_consts (1000, 10, 3, 1000) >>> dis.dis(compile('1000 is 10**3', '<stdin>', 'eval'))   1           0 LOAD_CONST               0 (1000)                3 LOAD_CONST               3 (1000)                6 COMPARE_OP               8 (is)                9 RETURN_VALUE

There is a peephole optimization that pre-computes expressions on constants at compile time, and this optimization has replaced 10 ** 3 with 1000, but the optimization doesn't re-use pre-existing constants. As a result, the LOAD_CONST opcodes are loading two different integer objects, at index 0 and 3, and these are two different int objects.

Then there are optimisations in place where small integers are interned; only one copy of the 1 object is ever created during the lifetime of a Python program; this applies to all integers between -5 and 256.

Thus, for the 1 is 1**2 case, the Python internals use a singleton int() object from the internal cache. This is a CPython implementation detail.

The moral of this story is that you should never use is when you really wanted to compare by value. Use == for integers, always.

101

answered Oct 05 '22 23:10

Martijn Pieters

Related questions
                            
                                python: Is there a downside to using faulthandler?
                            
                                Sorting the order of bars in pandas/matplotlib bar plots
                            
                                AttributeError: '_io.TextIOWrapper' object has no attribute 'next' python
                            
                                Sort by column within multi index level in pandas
                            
                                using pandas.read_csv to read certain columns
                            
                                How to save Scikit-Learn-Keras Model into a Persistence File (pickle/hd5/json/yaml)
                            
                                InvalidArgumentError: cannot compute MatMul as input #0(zero-based) was expected to be a float tensor but is a double tensor [Op:MatMul]
                            
                                Overriding 'to boolean' operator in python?
                            
                                How to know the encoding of a file in Python? [duplicate]
                            
                                Permission to view, but not to change! - Django
                            
                                paramiko Incompatible ssh peer (no acceptable kex algorithm)
                            
                                Read slave, read-write master setup
                            
                                How to get list of objects with unique attribute
                            
                                How to access List elements
                            
                                How to launch EC2 instance with Boto, specifying size of EBS?
                            
                                itertools.accumulate() versus functools.reduce()
                            
                                How to show multiple images in one figure?
                            
                                matplotlib hatched fill_between without edges?
                            
                                Python modules with submodules and functions
                            
                                Limiting/throttling the rate of HTTP requests in GRequests

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does Python handle '1 is 12' differently from '1000 is 103'?

Tags:

python

semantics

reference

python-internals

OrangeTux

People also ask

1 Answers

Martijn Pieters

Recent Activity

Donate For Us

Why does Python handle '1 is 1**2' differently from '1000 is 10**3'?

Tags:

python

semantics

reference

python-internals

OrangeTux

People also ask

1 Answers

Martijn Pieters

Related questions

Recent Activity

Donate For Us

Why does Python handle '1 is 12' differently from '1000 is 103'?