In my Python code I have this class:
class _Point2D:
    def __init__(self, x, y):
        self.x = x
        self.y = y    
    def __repr__(self):
        return 'point: (' + str(self.x) + ', ' + str(self.y) + ')' 
And there are two lists, initialPointsList and burnedPointsList:
initialPointsList = []
initialPointsList.append(_Point2D(1, 1))
initialPointsList.append(_Point2D(1, 2))
initialPointsList.append(_Point2D(1, 3))
initialPointsList.append(_Point2D(1, 4))
initialPointsList.append(_Point2D(1, 5))
initialPointsList.append(_Point2D(1, 6))
initialPointsList.append(_Point2D(1, 7))
burnedPointsList = []
burnedPointsList.append(_Point2D(1, 2))
burnedPointsList.append(_Point2D(1, 3))
I want to calculate the difference between initialPointsList and burnedPointsList
I have executed:
result = set(initialPointsList) - set(burnedPointsList)
for item in result:
    print item
And get the following output:
point: (1, 1)
point: (1, 4)
point: (1, 5)
point: (1, 6)
point: (1, 2)
point: (1, 3)
point: (1, 7)
But I expected another result, without burned point coordinates:
point: (1, 1)
point: (1, 4)
point: (1, 5)
point: (1, 6)
point: (1, 7)
What is the best way to do that in Python? What is incorrect with my code ?
If you want this to work correctly, you need to define the __eq__() and __hash__() special methods.  If you define __eq__(), it's usually also a good idea to define __ne__().
__eq__() should return True if its arguments are equivalent (their x and y values are the same).  __ne__() should do the opposite.  It's usually also desirable for __eq__() to do type checking, and return false if the "other" value is not of the same type as self.
__hash__() should return a number.  The number should be the same for two values which compare equal with __eq__(), and it's desirable but not strictly required for it to be different for distinct values.  A good implementation is this:
def __hash__(self):
    return hash((self.x, self.y))
The tuple hashing algorithm will combine the hash values of its elements in a statistically well-behaved way.  You may sometimes see people recommend bitwise XOR (i.e. self.x ^ self.y) here, but that isn't a good idea.  That technique throws away all the bits they have in common, which makes for inferior hashing performance (e.g. it always returns zero if self.x == self.y).
Finally, you need to make sure that hash values don't change after an object has been constructed.  This is most easily accomplished by converting self.x and self.y into read-only properties.
For completeness, here would be the __eq__, __ne__, and __hash__ methods as mentioned in Kevin's answer.
def __eq__(self, other):
    return type(self) is type(other) and self.x == other.x and self.y == other.y
def __ne__(self, other):
    return not self.__eq__(other)
def __hash__(self):
    return hash((self.x, self.y))
I test it by adding these methods to your class and it produces the expected output:
point: (1, 5)
point: (1, 6)
point: (1, 1)
point: (1, 4)
point: (1, 7)
                        If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With