Lets say that I have a graph and want to see if <code>b in N[a]</code>. Which is the faster implementation and why? <pre class="prettyprint"><code>a, b = range(2) N = [set([b]), set([a,b])] </code></pre> OR <pre class="prettyprint"><code>N= [[b],[a,b]] </code></pre> This is obviously oversimplified, but imagine that the graph becomes really dense.

Membership testing in a set is vastly faster, especially for large sets. That is because the set uses a hash function to map to a bucket. Since Python implementations automatically resize that hash table, the speed can be constant (<code>O(1)</code>) no matter the size of the set (assuming the hash function is sufficiently good). In contrast, to evaluate whether an object is a member of a list, Python has to compare every single member for equality, i.e. the test is <code>O(n)</code>.

It all depends on what you're trying to accomplish. Using your example verbatim, it's faster to use lists, as you don't have to go through the overhead of creating the sets: <pre class="prettyprint"><code>import timeit def use_sets(a, b): return [set([b]), set([a, b])] def use_lists(a, b): return [[b], [a, b]] t=timeit.Timer("use_sets(a, b)", """from __main__ import use_sets a, b = range(2)""") print "use_sets()", t.timeit(number=1000000) t=timeit.Timer("use_lists(a, b)", """from __main__ import use_lists a, b = range(2)""") print "use_lists()", t.timeit(number=1000000) </code></pre> Produces: <pre class="prettyprint"><code>use_sets() 1.57522511482 use_lists() 0.783344984055 </code></pre> However, for reasons already mentioned here, you benefit from using sets when you are searching large sets. It's impossible to tell by your example where that inflection point is for you and whether or not you'll see the benefit. I suggest you test it both ways and go with whatever is faster for your specific use-case.

Which is faster and why? Set or List?

Tags:

Lets say that I have a graph and want to see if b in N[a]. Which is the faster implementation and why?

a, b = range(2)
N = [set([b]), set([a,b])]

N= [[b],[a,b]]

This is obviously oversimplified, but imagine that the graph becomes really dense.

236

asked Oct 10 '11 18:10

locoboy

3 Answers

Membership testing in a set is vastly faster, especially for large sets. That is because the set uses a hash function to map to a bucket. Since Python implementations automatically resize that hash table, the speed can be constant (O(1)) no matter the size of the set (assuming the hash function is sufficiently good).

In contrast, to evaluate whether an object is a member of a list, Python has to compare every single member for equality, i.e. the test is O(n).

answered Dec 29 '22 00:12

phihag

Set ( I mean a hash based set like HashSet) is much faster than List to lookup for a value. List has to go sequentially to find out if the value exists. HashSet can directly jump and locate the bucket and look up for a value almost in a constant time.

answered Dec 28 '22 22:12

java_mouse

It all depends on what you're trying to accomplish. Using your example verbatim, it's faster to use lists, as you don't have to go through the overhead of creating the sets:

import timeit

def use_sets(a, b):
    return [set([b]), set([a, b])]

def use_lists(a, b):
    return [[b], [a, b]]

t=timeit.Timer("use_sets(a, b)", """from __main__ import use_sets
a, b = range(2)""")
print "use_sets()", t.timeit(number=1000000)

t=timeit.Timer("use_lists(a, b)", """from __main__ import use_lists
a, b = range(2)""")
print "use_lists()", t.timeit(number=1000000)

Produces:

use_sets() 1.57522511482
use_lists() 0.783344984055

However, for reasons already mentioned here, you benefit from using sets when you are searching large sets. It's impossible to tell by your example where that inflection point is for you and whether or not you'll see the benefit.

I suggest you test it both ways and go with whatever is faster for your specific use-case.

answered Dec 28 '22 23:12

Austin Marshall

Related questions
                            
                                Troubleshooting "Delimiter must not be alphanumeric or backslash" error when changing ereg() to preg_match() [duplicate]
                            
                                How to force Share Intent to open a specific app?
                            
                                Convert unsigned int to signed int C
                            
                                Deserializing JSON with Jackson - Why JsonMappingException "No suitable constructor"?
                            
                                Unsupported configuration plain style unsupported in a navigation item
                            
                                Disabling or greying out a DataGridView
                            
                                R remove non-alphanumeric symbols from a string
                            
                                iFrame 100% height causes vertical scrollbar
                            
                                Prevent empty strings in CHARACTER VARYING field
                            
                                CSS3 transitions want to add a colour and fade it away
                            
                                redis vs native sessions
                            
                                Twitter bootstrap - open modal over an already opened modal

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With