How can I skip the tuples which has duplicate elements in the iteration when I use <code>itertools.product</code>? Or let's say, is there anyway not to look at them in the iteration? Because skipping may be time consuming if the number of lists are too much. <pre class="prettyprint"><code>Example, lis1 = [1,2] lis2 = [2,4] lis3 = [5,6] [i for i in product(lis1,lis2,lis3)] should be [(1,2,5), (1,2,6), (1,4,5), (1,4,6), (2,4,5), (2,4,6)] </code></pre> It will not have <code>(2,2,5)</code> and <code>(2,2,6)</code> since 2 is duplicate in here. How can I do that?

<pre class="prettyprint"><code>lis1 = [1,2] lis2 = [2,4] lis3 = [5,6] from itertools import product print [i for i in product(lis1,lis2,lis3) if len(set(i)) == 3] </code></pre> Output <pre class="prettyprint"><code>[(1, 2, 5), (1, 2, 6), (1, 4, 5), (1, 4, 6), (2, 4, 5), (2, 4, 6)] </code></pre>

itertools.product eliminating repeated elements

Tags:

python

python-2.7

itertools

How can I skip the tuples which has duplicate elements in the iteration when I use itertools.product? Or let's say, is there anyway not to look at them in the iteration? Because skipping may be time consuming if the number of lists are too much.

Example,
lis1 = [1,2]
lis2 = [2,4]
lis3 = [5,6]

[i for i in product(lis1,lis2,lis3)] should be [(1,2,5), (1,2,6), (1,4,5), (1,4,6), (2,4,5), (2,4,6)]

It will not have (2,2,5) and (2,2,6) since 2 is duplicate in here. How can I do that?

759

asked Nov 02 '13 17:11

genclik27

2 Answers

itertools generally works on unique positions within inputs, not on unique values. So when you want to remove duplicate values, you generally have to either post-process the itertools result sequence, or "roll your own". Because post-processing can be very inefficient in this case, roll your own:

def uprod(*seqs):
    def inner(i):
        if i == n:
            yield tuple(result)
            return
        for elt in sets[i] - seen:
            seen.add(elt)
            result[i] = elt
            for t in inner(i+1):
                yield t
            seen.remove(elt)

    sets = [set(seq) for seq in seqs]
    n = len(sets)
    seen = set()
    result = [None] * n
    for t in inner(0):
        yield t

Then, e.g.,

>>> print list(uprod([1, 2, 1], [2, 4, 4], [5, 6, 5]))
[(1, 2, 5), (1, 2, 6), (1, 4, 5), (1, 4, 6), (2, 4, 5), (2, 4, 6)]
>>> print list(uprod([1], [1, 2], [1, 2, 4], [1, 5, 6]))
[(1, 2, 4, 5), (1, 2, 4, 6)]
>>> print list(uprod([1], [1, 2, 4], [1, 5, 6], [1]))
[]
>>> print list(uprod([1, 2], [3, 4]))
[(1, 3), (1, 4), (2, 3), (2, 4)]

This can be much more efficient, since a duplicate value is never even considered (neither within an input iterable, nor across them).

150

answered Sep 28 '22 07:09

Tim Peters

lis1 = [1,2]
lis2 = [2,4]
lis3 = [5,6]
from itertools import product
print [i for i in product(lis1,lis2,lis3) if len(set(i)) == 3]

Output

[(1, 2, 5), (1, 2, 6), (1, 4, 5), (1, 4, 6), (2, 4, 5), (2, 4, 6)]

answered Sep 28 '22 08:09

thefourtheye

Related questions
                            
                                Does this Python expression make sense?
                            
                                My first step in Python
                            
                                Extracting most frequent words out of a corpus with python
                            
                                How can I get the exponent of each number in a np.array?
                            
                                Turn flat list into two-tuples [duplicate]
                            
                                how to get User id from auth_user table in django?
                            
                                python collections.defaultdict with list of length two
                            
                                wxpython icon for task bar
                            
                                How to fetch the key/value pair of a dictionary only containing one item?
                            
                                Python linspace limits from two arrays
                            
                                Why is the python destructor being called?
                            
                                Why does Python operator.itemgetter work given a comma separated list of numbers as indices, but not when the same list is packaged in a variable?
                            
                                flask sqlalchemy unknown database error
                            
                                Get a value between min and max values in Python
                            
                                Python Docs Wrong About Regular Expression "\b"?
                            
                                Building a list inside a list in python
                            
                                Inserting into a html file using python
                            
                                Sort an array of tuples by product in python
                            
                                Python: Strip Everything but Numbers
                            
                                Tracing recursive function in paper

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With