Why is a False value (0) smaller in bytes than True (1)?

Tags:

I was playing around with sys's getsizeof() and found that False (or 0) consists of less bytes than True (or 1). Why is that?

import sys

print("Zero: " + str(sys.getsizeof(0)))
print("One: " + str(sys.getsizeof(1)))
print("False: " + str(sys.getsizeof(False)))
print("True: " + str(sys.getsizeof(True)))

# Prints:
# Zero: 24
# One: 28
# False: 24
# True: 28

In fact, other numbers (also some that consist of more than one digit) are 28 bytes.

for n in range(0, 12):
  print(str(n) + ": " + str(sys.getsizeof(n)))

# Prints:
# 0: 24
# 1: 28
# 2: 28
# 3: 28
# 4: 28
# 5: 28
# 6: 28
# 7: 28
# 8: 28
# 9: 28
# 10: 28
# 11: 28

Even more: sys.getsizeof(999999999) is also 28 bytes! sys.getsizeof(9999999999), however, is 32.

So what's going on? I assume that the booleans True and False are internally converted to 0 and 1 respectively, but why is zero different in size from other lower integers?

Side question: is this specific to how Python (3) represents these items, or is this generally how digits are presented in the OS?

208

asked Apr 19 '18 17:04

Bram Vanroy

1 Answers

Remember that Python int values are of arbitrary size. How does that work?

Well, in CPython,¹ an int is represented by a PyLong_Object, which has an array of 4-byte chunks², each holding 30 bits³ worth of the number.

0 takes no chunks at all.
1 - (1<<30)-1 takes 1 chunk.
1<<30 - (1<<60)-1 takes 2 chunks.

And so on.

This is slightly oversimplified; for full details, see longintrepr.h in the source.

In Python 2, there are two separate types, called int and long. An int is represented by a C 32-bit signed integer⁴ embedded directly in the header, instead of an array of chunks. A long is like a Python 3 int.

If you do the same test with 0L, 1L, etc., to explicitly ask for long values, you will get the same results as in Python 3. But without the L suffix, any literal that fits in 32 bits gives you an int, and only literals that are too big give you longs.⁵ (This means that (1<<31)-1 is an int, but 1<<31 is a 2-chunk long.)

_{1. In a different implementation, this might not be true. IIRC, Jython does roughly the same thing as CPython, but IronPython uses a C# "bignum" implementation.}

_{2. Why 30 bits instead of 32? Mainly because the implementation of pow and ** can be simpler and faster if it can assume that the number of bits in two "digits" is divisible by 10.}

_{3. It uses the C "struct hack". Technically, a Py_LongObject is 28 bytes, but nobody ever allocates a Py_LongObject; they malloc 24, 28, 32, 36, etc. bytes then cast to Py_LongObject *.}

_{4. In fact, a Python int is a C long, just to make things confusing. So the C API is full of things like PyInt_FromLong where the long means "32-bit int" and PyLong_FromSize_t where the long means "bignum".}

_{5. Early versions of Python 2.x didn't integrate int and long as nicely, but hopefully nobody has to worry about those anymore.}

193

answered Oct 18 '22 01:10

abarnert

Related questions
                            
                                Python: Usable Max and Min values
                            
                                Parallel Pip install
                            
                                Python: what are the nearest Linux and OSX equivalents of winsound.Beep?
                            
                                The preferred way to set matplotlib figure/axes properties
                            
                                isinstance(foo,bar) vs type(foo) is bar
                            
                                What is the type hint for a (any) python module?
                            
                                matplotlib - subplots with fixed aspect ratio
                            
                                Python method name with double-underscore is overridden?
                            
                                what's the tornado ioloop, and tornado's workflow?
                            
                                How to create a SSH tunnel using Python and Paramiko?
                            
                                Fastest way to calculate the centroid of a set of coordinate tuples in python without numpy
                            
                                When to use Category rather than Object?
                            
                                Creating a new column in Panda by using lambda function on two existing columns
                            
                                Tensorflow mean squared error loss function
                            
                                How to change spacing between ticks in matplotlib?
                            
                                Are numpy's basic operations vectorized, i.e. do they use SIMD operations?
                            
                                How to specify a configuration file for pylint under windows?
                            
                                how to test for a regex match
                            
                                What is python's equivalent of R's NA?
                            
                                What are the differences between mysql-connector-python, mysql-connector-python-rf and mysql-connector-repackaged?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is a False value (0) smaller in bytes than True (1)?

Tags:

python

types

cpython

size

Bram Vanroy

People also ask

1 Answers

abarnert

Recent Activity

Donate For Us