python: class vs tuple huge memory overhead (?)

Tags:

I'm storing a lot of complex data in tuples/lists, but would prefer to use small wrapper classes to make the data structures easier to understand, e.g.

class Person:
    def __init__(self, first, last):
        self.first = first
        self.last = last

p = Person('foo', 'bar')
print(p.last)
...

would be preferable over

p = ['foo', 'bar']
print(p[1])
...

however there seems to be a horrible memory overhead:

l = [Person('foo', 'bar') for i in range(10000000)]
# ipython now taks 1.7 GB RAM

and

del l
l = [('foo', 'bar') for i in range(10000000)]
# now just 118 MB RAM

Why? is there any obvious alternative solution that I didn't think of?

Thanks!

(I know, in this example the 'wrapper' class looks silly. But when the data becomes more complex and nested, it is more useful)

363

asked Jul 15 '17 22:07

seb314

1 Answers

As others have said in their answers, you'll have to generate different objects for the comparison to make sense.

So, let's compare some approaches.

`tuple`

l = [(i, i) for i in range(10000000)]
# memory taken by Python3: 1.0 GB

`class Person`

class Person:
    def __init__(self, first, last):
        self.first = first
        self.last = last

l = [Person(i, i) for i in range(10000000)]
# memory: 2.0 GB

`namedtuple` (`tuple` + `slots`)

from collections import namedtuple
Person = namedtuple('Person', 'first last')

l = [Person(i, i) for i in range(10000000)]
# memory: 1.1 GB

namedtuple is basically a class that extends tuple and uses __slots__ for all named fields, but it adds fields getters and some other helper methods (you can see the exact code generated if called with verbose=True).

`class Person` + `slots`

class Person:
    __slots__ = ['first', 'last']
    def __init__(self, first, last):
        self.first = first
        self.last = last

l = [Person(i, i) for i in range(10000000)]
# memory: 0.9 GB

This is a trimmed-down version of namedtuple above. A clear winner, even better than pure tuples.

131

answered Oct 04 '22 09:10

randomir

Related questions
                            
                                Imputer on some Dataframe columns in Python
                            
                                How to get around this memoryview error in numpy?
                            
                                Compress/Zip numpy arrays in Memory
                            
                                Build 2 lists in one go while reading from file, pythonically
                            
                                How does "tf.train.replica_device_setter" work?
                            
                                How to schedule and cancel tasks with asyncio
                            
                                Numpy: how I can determine if all elements of numpy array are equal to a number
                            
                                Django migrate error : TypeError expected string or bytes-like object
                            
                                Retrieve list of training features names from classifier
                            
                                What is the difference between import numpy and import math [duplicate]
                            
                                pytest monkeypatch.setattr() inside of test class method
                            
                                How to compare two columns of the same dataframe?
                            
                                werkzeug.security generate_password_hash alternative without SHA-1
                            
                                python click help formatting newline
                            
                                Python: How to catch this Error (can't source error name) - binascii.Error
                            
                                Sorting a zipped object in python 3 [duplicate]
                            
                                How to use custom password validators beside the django auth password validators?
                            
                                Skimage - Weird results of resize function
                            
                                PySide2 on windows
                            
                                In pandas, how to concatenate horizontally and then remove the redundant columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python: class vs tuple huge memory overhead (?)

Tags:

python

list

data-structures

class

tuples

seb314

People also ask

1 Answers

`tuple`

`class Person`

`namedtuple` (`tuple` + `slots`)

`class Person` + `slots`

randomir

Recent Activity

Donate For Us

python: class vs tuple huge memory overhead (?)

Tags:

python

list

data-structures

class

tuples

seb314

People also ask

1 Answers

tuple

class Person

namedtuple (tuple + __slots__)

class Person + __slots__

randomir

Related questions

Recent Activity

Donate For Us

`tuple`

`class Person`

`namedtuple` (`tuple` + `slots`)

`class Person` + `slots`