Python 2: different meaning of the 'in' keyword for sets and lists

Tags:

Consider this snippet:

class SomeClass(object):

    def __init__(self, someattribute="somevalue"):
        self.someattribute = someattribute

    def __eq__(self, other):
        return self.someattribute == other.someattribute

    def __ne__(self, other):
        return not self.__eq__(other)

list_of_objects = [SomeClass()]
print(SomeClass() in list_of_objects)

set_of_objects = set([SomeClass()])
print(SomeClass() in set_of_objects)

which evaluates to:

True
False

Can anyone explain why the 'in' keyword has a different meaning for sets and lists? I would have expected both to return True, especially when the type being tested has equality methods defined.

253

asked Feb 13 '12 04:02

mskel

2 Answers

The meaning is the same, but the implementation is different. Lists simply examine each object, checking for equality, so it works for your class. Sets first hash the objects, and if they don't implement hash properly, the set appears not to work.

Your class defines __eq__, but doesn't define __hash__, and so won't work properly for sets or as keys of dictionaries. The rule for __eq__ and __hash__ is that two objects that __eq__ as True must also have equal hashes. By default, objects hash based on their memory address. So your two objects that are equal by your definition don't provide the same hash, so they break the rule about __eq__ and __hash__.

If you provide a __hash__ implementation, it will work fine. For your sample code, it could be:

def __hash__(self):
    return hash(self.someattribute)

105

answered Oct 20 '22 18:10

Ned Batchelder

In pretty much any hashtable implementation, including Python's, if you override the equality method you must override the hashing method (in Python, this is __hash__). The in operator for lists just checks equality with every element of the list, which the in operator for sets first hashes the object you are looking for, checks for an object in that slot of the hashtable, and then checks for equality if there is anything in the slot. So, if you override __eq__ without overriding __hash__, you cannot be guaranteed that the in operator for sets will check in the right slot.

answered Oct 20 '22 18:10

Adam Mihalcin

Related questions
                            
                                Find the closest hour
                            
                                defaultdict with a parameter to the class constructor
                            
                                Python: variable-length tuples
                            
                                How to raise exception if None value encountered in dict?
                            
                                can I put my sqlite connection and cursor in a function?
                            
                                Django: DatabaseError column does not exist
                            
                                Cassandra low performance?
                            
                                How do I stream a file using werkzeug?
                            
                                PIL: enlarge an image
                            
                                Column default value persisted to the table
                            
                                String In python with my unicode?
                            
                                On second initialization of an object, why is __init__ called before __del__?
                            
                                PyCUDA: Querying Device Status (Memory specifically)
                            
                                Explicitly set docstring of a method
                            
                                Python ctypes: How do I flush output from stderr?
                            
                                How to change the url using django process_request .
                            
                                Python: How to load a module twice?
                            
                                syntax error with KeyError in python 3.2
                            
                                EOF Error in Imaplib
                            
                                Python, how to put 32-bit integer into byte array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python 2: different meaning of the 'in' keyword for sets and lists

Tags:

python

equality

list

set

mskel

People also ask

2 Answers

Ned Batchelder

Adam Mihalcin

Recent Activity

Donate For Us