Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Python set with the ability to pop a random element

Tags:

I am in need of a Python (2.7) object that functions like a set (fast insertion, deletion, and membership checking) but has the ability to return a random value. Previous questions asked on stackoverflow have answers that are things like:

import random random.sample(mySet, 1) 

But this is quite slow for large sets (it runs in O(n) time).

Other solutions aren't random enough (they depend on the internal representation of python sets, which produces some results which are very non-random):

for e in mySet:     break # e is now an element from mySet 

I coded my own rudimentary class which has constant time lookup, deletion, and random values.

class randomSet:     def __init__(self):         self.dict = {}         self.list = []      def add(self, item):         if item not in self.dict:             self.dict[item] = len(self.list)             self.list.append(item)      def addIterable(self, item):         for a in item:             self.add(a)      def delete(self, item):         if item in self.dict:             index = self.dict[item]             if index == len(self.list)-1:                 del self.dict[self.list[index]]                 del self.list[index]             else:                 self.list[index] = self.list.pop()                 self.dict[self.list[index]] = index                 del self.dict[item]      def getRandom(self):         if self.list:             return self.list[random.randomint(0,len(self.list)-1)]      def popRandom(self):         if self.list:             index = random.randint(0,len(self.list)-1)             if index == len(self.list)-1:                 del self.dict[self.list[index]]                 return self.list.pop()             returnValue = self.list[index]             self.list[index] = self.list.pop()             self.dict[self.list[index]] = index             del self.dict[returnValue]             return returnValue 

Are there any better implementations for this, or any big improvements to be made to this code?

like image 847
GrantS Avatar asked Sep 25 '12 16:09

GrantS


People also ask

How do you pop a random element from a set in Python?

The set pop() is a built-in Python method that removes the random element from the set and returns the removed element. The pop() method does not pop from the back or front; it can pop any element from anywhere present in the set. Unlike the stack, a random element is popped off the set.

Is set Pop () random?

Python Set | pop() Python set pop() Method removes any a random element from the set and returns the removed element.

Does set have pop Python?

The Python built-in type set has a method called pop(), from the docs: Remove and return an arbitrary element from the set.

How does pop work in set in Python?

Python Set pop() Method The pop() method removes a random item from the set. This method returns the removed item.


1 Answers

I think the best way to do this would be to use the MutableSet abstract base class in collections. Inherit from MutableSet, and then define add, discard, __len__, __iter__, and __contains__; also rewrite __init__ to optionally accept a sequence, just like the set constructor does. MutableSet provides built-in definitions of all other set methods based on those methods. That way you get the full set interface cheaply. (And if you do this, addIterable is defined for you, under the name extend.)

discard in the standard set interface appears to be what you have called delete here. So rename delete to discard. Also, instead of having a separate popRandom method, you could just define popRandom like so:

def popRandom(self):     item = self.getRandom()     self.discard(item)     return item 

That way you don't have to maintain two separate item removal methods.

Finally, in your item removal method (delete now, discard according to the standard set interface), you don't need an if statement. Instead of testing whether index == len(self.list) - 1, simply swap the final item in the list with the item at the index of the list to be popped, and make the necessary change to the reverse-indexing dictionary. Then pop the last item from the list and remove it from the dictionary. This works whether index == len(self.list) - 1 or not:

def discard(self, item):     if item in self.dict:         index = self.dict[item]         self.list[index], self.list[-1] = self.list[-1], self.list[index]         self.dict[self.list[index]] = index         del self.list[-1]                    # or in one line:         del self.dict[item]                  # del self.dict[self.list.pop()] 
like image 75
senderle Avatar answered Sep 29 '22 14:09

senderle