Internals of Python list, access and resizing runtimes

Tags:

Is Python's [] a list or an array?
Is the access time of an index O(1) like an array or O(n) like a list?
Is appending/resizing O(1) like a list or O(n) like an array, or is it a hybrid that can manage O(1) for accessing and resizing?

I read here that array access is really slow in Python. However, when I wrote a memoized version of a recursive fibonacci procedure using both a dictionary (Python's dictionary is suppose to be really fast) and a list, they had equal times. Why is this?

Does a Python tuple have faster access times than a python list?

984

asked May 09 '11 03:05

daveeloo

1 Answers

Python's [] is implemented as an array, not a linked list. Although resizing is O(n), appending to it is amortized O(1), because resizes happen very rarely. If you're not familiar with how this works, read this Wikipedia entry on dynamic arrays. Python's list doesn't expand by a factor of 2 each time, it's a bit more complicated than that, but the expansions are still designed to make appending amortized O(1).

Inserting in the middle, however, is always an inefficient O(n), because n items may have to be moved.

Tuples aren't faster than lists - they're just immutable lists under the hood (*).

Regarding your dictionary test: depending on your exact implementation, caching in a list will be faster than with a dict. However, Python's dicts are highly optimized, and especially for small amounts of keys will perform great.

(*) Here's a list's "get item" C function in Python 2.6:

PyObject * PyList_GetItem(PyObject *op, Py_ssize_t i) {     if (!PyList_Check(op)) {         PyErr_BadInternalCall();         return NULL;     }     if (i < 0 || i >= Py_SIZE(op)) {         if (indexerr == NULL)             indexerr = PyString_FromString(                 "list index out of range");         PyErr_SetObject(PyExc_IndexError, indexerr);         return NULL;     }     return ((PyListObject *)op) -> ob_item[i]; }

And this is a tuple's:

PyObject * PyTuple_GetItem(register PyObject *op, register Py_ssize_t i) {     if (!PyTuple_Check(op)) {         PyErr_BadInternalCall();         return NULL;     }     if (i < 0 || i >= Py_SIZE(op)) {         PyErr_SetString(PyExc_IndexError, "tuple index out of range");         return NULL;     }     return ((PyTupleObject *)op) -> ob_item[i]; }

As you can see, they're almost exactly the same. In the end, after some type and bound checking, it's a simple pointer access with an index.

[Reference: Python documentation on Time Complexity for data type operations]

148

answered Sep 25 '22 01:09

Eli Bendersky

Related questions
                            
                                Pytorch: how to add L1 regularizer to activations?
                            
                                aws lambda Unable to import module 'lambda_function': No module named 'requests'
                            
                                How do you override vim options via comments in a python source code file?
                            
                                django content types - how to get model class of content type to create a instance?
                            
                                How to handle urllib's timeout in Python 3?
                            
                                What is a reference cycle in python?
                            
                                class variables is shared across all instances in python? [duplicate]
                            
                                How do I change button size in Python?
                            
                                InvalidRequestError: VARCHAR requires a length on dialect mysql
                            
                                Python OrderedDict iteration
                            
                                Trouble installing private github repository using pip
                            
                                How to make Ipython output a list without line breaks after elements?
                            
                                Overloading Addition, Subtraction, and Multiplication Operators
                            
                                Transpose nested list in python
                            
                                Pandas Correlation Groupby
                            
                                Pandas DataFrame stack multiple column values into single column
                            
                                portaudio.h: No such file or directory
                            
                                Seaborn heatmap not displaying all xticks and yticks
                            
                                What is Jython and is it useful at all? [closed]
                            
                                Python/Erlang: What's the difference between Twisted, Stackless, Greenlet, Eventlet, Coroutines? Are they similar to Erlang processes?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Internals of Python list, access and resizing runtimes

Tags:

python

list

time

internals

space

daveeloo

People also ask

1 Answers

Eli Bendersky

Recent Activity

Donate For Us