Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

python: deque vs list performance comparison

In python docs I can see that deque is a special collection highly optimized for poping/adding items from left or right sides. E.g. documentation says:

Deques are a generalization of stacks and queues (the name is pronounced “deck” and is short for “double-ended queue”). Deques support thread-safe, memory efficient appends and pops from either side of the deque with approximately the same O(1) performance in either direction.

Though list objects support similar operations, they are optimized for fast fixed-length operations and incur O(n) memory movement costs for pop(0) and insert(0, v) operations which change both the size and position of the underlying data representation.

I decided to make some comparisons using ipython. Could anyone explain me what I did wrong here:

In [31]: %timeit range(1, 10000).pop(0)  10000 loops, best of 3: 114 us per loop  In [32]: %timeit deque(xrange(1, 10000)).pop()  10000 loops, best of 3: 181 us per loop  In [33]: %timeit deque(range(1, 10000)).pop() 1000 loops, best of 3: 243 us per loop 
like image 679
Oleg Tarasenko Avatar asked May 06 '14 06:05

Oleg Tarasenko


People also ask

Is deque faster than list Python?

Of course, it can be done using lists, but with deques, you already have the interface for that, and it's much faster.

Is deque fast?

Python's collections module provides a class called deque that's specially designed to provide fast and memory-efficient ways to append and pop item from both ends of the underlying data structure.

How is deque different from list in Python?

Deque is a doubly linked list while List is just an array. Randomly accessing an object at index i is O(n) for Deque but O(1) for List . Fast insertions and deletions at the beginning is the biggest advantage of Deque . Fast random reads is the advantage of List .

Is a deque a list?

A deque is a set of linked memory blocks, where more than one element is stored in each memory block. A list is a set of elements dispersed in memory, i.e.: only one element is stored per memory "block".


1 Answers

Could anyone explain me what I did wrong here

Yes, your timing is dominated by the time to create the list or deque. The time to do the pop is insignificant in comparison.

Instead you should isolate the thing you're trying to test (the pop speed) from the setup time:

In [1]: from collections import deque  In [2]: s = list(range(1000))  In [3]: d = deque(s)  In [4]: s_append, s_pop = s.append, s.pop  In [5]: d_append, d_pop = d.append, d.pop  In [6]: %timeit s_pop(); s_append(None) 10000000 loops, best of 3: 115 ns per loop  In [7]: %timeit d_pop(); d_append(None) 10000000 loops, best of 3: 70.5 ns per loop 

That said, the real differences between deques and list in terms of performance are:

  • Deques have O(1) speed for appendleft() and popleft() while lists have O(n) performance for insert(0, value) and pop(0).

  • List append performance is hit and miss because it uses realloc() under the hood. As a result, it tends to have over-optimistic timings in simple code (because the realloc doesn't have to move data) and really slow timings in real code (because fragmentation forces realloc to move all the data). In contrast, deque append performance is consistent because it never reallocs and never moves data.

like image 64
Raymond Hettinger Avatar answered Sep 23 '22 13:09

Raymond Hettinger