Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Create iterator to return elements from each iterable one by one

I was learning the itertools module and I am trying to make an iterator to return each element from the iterables provided as input.

Agruments   Results
p, q, …     p0, q0, … plast, qlast 

with one more rider that if say the lists are not of the same length then next(it) should return elements from the longer list when the shorter one runs out.

Attempt at solution

import itertools
l1=[1,2,3,4,5,6]
l2=['a','b','c','d']
l=[]
for x,y in itertools.zip_longest(l1,l2):
    l.extend([x,y])
it=iter(x for x in l if x is not None)

Which kind of solves my problem

print(list(it))

Outputs:

[1, 'a', 2, 'b', 3, 'c', 4, 'd', 5, 6]

Is there an easier or better way to do this? I searched for a solution on SO and was not able to get one.

like image 741
Bitto Bennichan Avatar asked Jan 29 '19 21:01

Bitto Bennichan


People also ask

How do I return an iterator in Python?

Iterator objects in python conform to the iterator protocol, which basically means they provide two methods: __iter__() and __next__() . The __iter__ returns the iterator object and is implicitly called at the start of loops. The __next__() method returns the next value and is implicitly called at each loop increment.

What does __ ITER __ return?

The __iter__() function returns an iterator for the given object (array, set, tuple, etc. or custom objects). It creates an object that can be accessed one element at a time using __next__() function, which generally comes in handy when dealing with loops.

Which function return an iterable with a next () method that yields the next element in a sequence?

The __iter__() method returns the iterator object itself. If required, some initialization can be performed. The __next__() method must return the next item in the sequence. On reaching the end, and in subsequent calls, it must raise StopIteration .

Which method returns the iterator object itself?

The __iter__() method acts similar, you can do operations (initializing etc.), but must always return the iterator object itself. The __next__() method also allows you to do operations, and must return the next item in the sequence.


2 Answers

You can use itertools.chain.from_iterable() to flatten the sequence, and use a generator expression to filter out the None values:

from itertools import chain, zip_longest

it = (v for v in chain.from_iterable(zip_longest(l1, l2)) if v is not None)

Rather than use None as the sentinel value, you may want to use a dedicated sentinel so you can use None in the input list:

_sentinel = object()
flattened = chain.from_iterable(zip_longest(l1, l2, fillvalue=_sentinel))
it = (v for v in flattened if v is not _sentinel)

If you want to filter out falsey values, then you can also use filter(None, ...):

it = filter(None, chain.from_iterable(zip_longest(l1, l2)))

Demo:

>>> from itertools import chain, zip_longest
>>> l1 = [1, 2, 3, 4, 5, 6]
>>> l2 = ['a', 'b', 'c', 'd']
>>> it = (v for v in chain.from_iterable(zip_longest(l1, l2)) if v is not None)
>>> list(it)
[1, 'a', 2, 'b', 3, 'c', 4, 'd', 5, 6]

and with a local sentinel:

>>> l1 = [1, None, 2, None, 3, None]
>>> l2 = ['a', 'b', 'c', 'd']
>>> _sentinel = object()
>>> flattened = chain.from_iterable(zip_longest(l1, l2, fillvalue=_sentinel))
>>> it = (v for v in flattened if v is not _sentinel)
>>> list(it)
[1, 'a', None, 'b', 2, 'c', None, 'd', 3, None]

The itertools recipes section also has:

def roundrobin(*iterables):
    "roundrobin('ABC', 'D', 'EF') --> A D E B F C"
    # Recipe credited to George Sakkis
    num_active = len(iterables)
    nexts = cycle(iter(it).__next__ for it in iterables)
    while num_active:
        try:
            for next in nexts:
                yield next()
        except StopIteration:
            # Remove the iterator we just exhausted from the cycle.
            num_active -= 1
            nexts = cycle(islice(nexts, num_active))
like image 191
Martijn Pieters Avatar answered Oct 08 '22 03:10

Martijn Pieters


If you want a modified version of your code, building a generator from the start (no storing list l):

import itertools
l1=[1,2,3,4,5,6]
l2=['a','b','c','d']

def flat_zip(l1,l2):
    for x,y in itertools.zip_longest(l1,l2):
        if x:
            yield x
        if y:
            yield y
it=flat_zip(l1,l2)

Though I advise for using the builtin solutions above.

like image 39
Learning is a mess Avatar answered Oct 08 '22 05:10

Learning is a mess