With <code>x = [1,2,3,4]</code>, I can get an iterator from <code>i = iter(x)</code>. With this iterator, I can use zip function to create a tuple with two items. <pre class="prettyprint"><code>>>> i = iter(x) >>> zip(i,i) [(1, 2), (3, 4)] </code></pre> Even I can use this syntax to get the same results. <pre class="prettyprint"><code>>>> zip(*[i] * 2) [(1, 2), (3, 4)] </code></pre> How does this work? How an iterator with <code>zip(i,i)</code> and <code>zip(*[i] * 2)</code> work?

An iterator is like a stream of items. You can only look at the items in the stream one at a time and you only ever have access to the first element. To look at something in the stream, you need to remove it from the stream and once you take something from the top of the stream, it's gone from the stream for good. When you call <code>zip(i, i)</code>, <code>zip</code> first looks at the first stream and takes an item out. Then it looks at the second stream (which happens to be the same stream as the first one) and takes an item out. Then it makes a tuple out of those two items and repeats this over and over until there is nothing left in the stream. Maybe it's easier to see if I were to write the <code>zip</code> function in pure python (with only 2 arguments for simplicity). It would look something like1: <pre class="prettyprint"><code>def zip(a, b): out = [] try: while True: item1 = next(a) item2 = next(b) out.append((item1, item2)) except StopIteration: return out </code></pre> Now imagine the case that you are talking about where <code>a</code> and <code>b</code> are the same object. In that case, we just call <code>next</code> twice on the iterator (<code>i</code> in your example case) which will just take the first two items from <code>i</code> in sequence and pack them into a tuple. Once we've understood why <code>zip(i, i)</code> behaves the way it does, <code>zip(*([i] * 2))</code> isn't too hard. Lets read the expression from the inside out... <pre class="prettyprint"><code>[i] * 2 </code></pre> That just creates a new list (of length 2) where both of the elements are references to the iterator <code>i</code>. So it's the same thing as <code>zip(*[i, i])</code> (it's just more convenient to write when you want to repeat something many more than 2 times). <code>*</code> unpacking is a common idiom in python and you can find more information in the python tutorial. The gist of it is that python takes the iterable and "unpacks" it as if each item of the iterable was a separate positional argument to the function. So: <pre class="prettyprint"><code>zip(*[i, i]) </code></pre> does the same thing as: <pre class="prettyprint"><code>zip(i, i) </code></pre> And now Bob's our uncle. We've just come full-circle since <code>zip(i, i)</code> is where this discussion started. 1This example code is definitely simplified more than just the afore-mentioned only accepting 2 arguments. For example, <code>zip</code> is probably going to call <code>iter</code> on the input arguments so that it works for any iterable (not just iterators), but this should be enough to get the point across...

Python iterator and zip

Tags:

python

iterator

zip

With x = [1,2,3,4], I can get an iterator from i = iter(x).

With this iterator, I can use zip function to create a tuple with two items.

>>> i = iter(x)
>>> zip(i,i)
[(1, 2), (3, 4)]

Even I can use this syntax to get the same results.

>>> zip(*[i] * 2)
[(1, 2), (3, 4)]

How does this work? How an iterator with zip(i,i) and zip(*[i] * 2) work?

819

asked Jun 25 '16 03:06

prosseek

1 Answers

An iterator is like a stream of items. You can only look at the items in the stream one at a time and you only ever have access to the first element. To look at something in the stream, you need to remove it from the stream and once you take something from the top of the stream, it's gone from the stream for good.

When you call zip(i, i), zip first looks at the first stream and takes an item out. Then it looks at the second stream (which happens to be the same stream as the first one) and takes an item out. Then it makes a tuple out of those two items and repeats this over and over until there is nothing left in the stream.

Maybe it's easier to see if I were to write the zip function in pure python (with only 2 arguments for simplicity). It would look something like¹:

def zip(a, b):
    out = []
    try:
        while True:
            item1 = next(a)
            item2 = next(b)
            out.append((item1, item2))
    except StopIteration:
        return out

Now imagine the case that you are talking about where a and b are the same object. In that case, we just call next twice on the iterator (i in your example case) which will just take the first two items from i in sequence and pack them into a tuple.

Once we've understood why zip(i, i) behaves the way it does, zip(*([i] * 2)) isn't too hard. Lets read the expression from the inside out...

[i] * 2

That just creates a new list (of length 2) where both of the elements are references to the iterator i. So it's the same thing as zip(*[i, i]) (it's just more convenient to write when you want to repeat something many more than 2 times). * unpacking is a common idiom in python and you can find more information in the python tutorial. The gist of it is that python takes the iterable and "unpacks" it as if each item of the iterable was a separate positional argument to the function. So:

zip(*[i, i])

does the same thing as:

zip(i, i)

And now Bob's our uncle. We've just come full-circle since zip(i, i) is where this discussion started.

^{¹This example code is definitely simplified more than just the afore-mentioned only accepting 2 arguments. For example, zip is probably going to call iter on the input arguments so that it works for any iterable (not just iterators), but this should be enough to get the point across...}

104

answered Oct 23 '22 15:10

mgilson

Related questions
                            
                                How to XOR two strings in Python
                            
                                How do Dask dataframes handle larger-than-memory datasets?
                            
                                Using bounding rectangle to get rotation angle not working (OpenCV/Python)
                            
                                Forward fill all except last value in python pandas dataframe
                            
                                Checking if text file is empty Python [duplicate]
                            
                                Django serializers: validate function not called
                            
                                Is it possible to run a command that is in a list?
                            
                                Get index values from slice objects in python [duplicate]
                            
                                Use of StreamField in Snippets on Wagtail
                            
                                Django model DateTimeField set auto_now_add format or modify the serializer
                            
                                Difficulty comparing generated and google cloud storage provided CRC32c checksums
                            
                                How to send axhline to back of Matplotlib's barplot
                            
                                Lazy evaluation of map
                            
                                Convert RGB triplets to LAB triplets using skimage.color.rgb2lab()
                            
                                Pandas Read_CSV quotes issue
                            
                                accessing all non zero entries of a csr_matrix
                            
                                How to add counter column in django-tables2?
                            
                                Python: __file__ of the caller
                            
                                Can't import nltk module in Juypter notebook?
                            
                                What's wrong with my GMRES implementation?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With