Why is the memory usage of a Python list smaller than expected?

Tags:

enter image description here

As seen in the picture. 50 000 000 records only take 404M memory, why? Since one record takes 83 Bytes, 50 000 000 records should take 3967M memory.

>>> import sys
>>> a=[]
>>> for it in range(5*10**7):a.append("miJ8ZNFG9iFqiQQohvyTWwqsij2rJCiZ7v"+str(it))
... 
>>> print(sys.getsizeof(a)/1024**2)
404.4306411743164
>>> print(sys.getsizeof("miJ8ZNFG9iFqiQQohvyTWwqsij2rJCiZ7v"))
83
>>> print(83*5*10**7/1024**2)
3957.7484130859375
>>>

551

asked Jan 17 '19 03:01

purplecity

1 Answers

sys.getsizeof only reports the cost of the list itself, not its contents. So you're seeing the cost of storing the list object header, plus (a little over) 50M pointers; you're likely on a 64 bit (eight byte) pointer system, thus storage for 50M pointers is ~400 MB. Getting the true size would require sys.getsizeof to be called for each object, each object's __dict__ (if applicable), etc., recursively, and it won't be 100% accurate since some of the objects (e.g. small ints) are likely shared; this is not a rabbit hole you want to go down.

answered Oct 31 '22 18:10

ShadowRanger

Related questions
                            
                                Grouping and pivoting DataFrame with additional column for ratio of counts
                            
                                What does conda env do under the hood?
                            
                                Cosine similarity for very large dataset
                            
                                Trying to find a large string between a start point and end point using regex
                            
                                Python automatically converting some strings to raw strings?
                            
                                Why does asyncio subprocess.communicate hang when called in different thread?
                            
                                Using Numpy to solve Linear Equations involving modulo operation
                            
                                How to move installed packages to a newly created virtual environment ?
                            
                                How to tell if the next line should be indented when parsing python
                            
                                All combinations of set of dictionaries into K N-sized groups
                            
                                What would be Promise.race equivalent in Python asynchronous code?
                            
                                Dump intermediate results of multiprocessing job to filesystem and continue with processing later on
                            
                                How to fix "Unable to import module" error in AWS lambda
                            
                                Python 3.5 with OpenSSL v > 1 MAC OSX Mojave
                            
                                Explaining CNN (Keras) outputs with LIME
                            
                                Is it possible to get a confidence score on Spacy Named-entity recognition
                            
                                opencv-python : drawMatchesKnn() always return NULL
                            
                                How to get a reverse mapping in numpy in O(1)?
                            
                                Vector of custom struct in PyO3
                            
                                pylint warning on 'except Exception:'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is the memory usage of a Python list smaller than expected?

Tags:

python

list

memory

purplecity

People also ask

1 Answers

ShadowRanger

Recent Activity

Donate For Us