Why does a class need to define <code>__iter__()</code> returning self, to get an iterator of the class? <pre class="prettyprint"><code>class MyClass: def __init__(self): self.state = 0 def __next__(self): self.state += 1 if self.state > 4: raise StopIteration return self.state myObj = MyClass() for i in myObj: print(i) </code></pre> Console log: <pre class="prettyprint"><code>Traceback (most recent call last): for i in myObj: TypeError: 'MyClass' object is not iterable </code></pre> the answer https://stackoverflow.com/a/9884259/4515198, says <blockquote> An iterator is an object with a next (Python 2) or <code>__next__</code> (Python 3) method. </blockquote> The task of adding the following: <pre class="prettyprint"><code>def __iter__(self): return self </code></pre> is to return an iterator, or an object of the class, which defines the <code>__next__()</code> method. But, isn't the task of returning an object of MyClass (which defines the <code>__next__()</code> method) already done by the <code>__new__()</code> method, when MyClass is instantiated in the line myObj = MyClass() ? Won't the objects of a class defining <code>__next__()</code> method, be iterators by themselves? I have studied the questions What is the use of returning self in the __iter__ method? and Build a Basic Python Iterator, but I am still unable to understand the reason for having an <code>__iter__()</code> method returning self.

The answer to the question of why the __iter__() method is necessary is that for for-loops always start by calling iter() on an object to get an iterator. That is why even iterators themselved need an __iter__() method to work with for-loops. After for calls iter(), then it calls __next__() on the resulting iterator to obtain a value. The rules for creating iterables and iterators are: 1) Iterables have an __iter__() method that returns an iterator. 2) Iterators have a __next__() method that returns a value, that updates the state, and that raises StopIteration when complete. 3) Iterators themselves have a __iter__() method that returns self. That means that all iterators are self-iterable. The benefit of the last rule for iterators having an __iter__() method that returns self is that it allows us to pass around partially consumed iterators: <pre class="prettyprint"><code>>>> s = 'hello world' >>> it = iter(s) >>> next(it) 'h' >>> next(it) 'e' >>> list(it) # Doesn't start from the beginning ['l', 'l', 'o', ' ', 'w', 'o', 'r', 'l', 'd'] </code></pre> Here's another example that depends on iterators being self-iterable without restarting: <pre class="prettyprint"><code>>>> s = 'hello world' >>> it = iter(s) >>> list(zip(it, it)) [('h', 'e'), ('l', 'l'), ('o', ' '), ('w', 'o'), ('r', 'l')] </code></pre> Notes: 1) An alternative way to make an iterable is to supply a __getitem__() method that accepts consecutive indices and raises IndexError when complete. This is how str objects iterated in Python 2. 2) Some objects like files are their own iterator. That means that you can call next() directly on a file object. It also means that files cannot have multiple, independent iterators (the file object itself has the state tracking the position within the file). 3) The iterator design pattern described above isn't Python specific. It is a general purpose design pattern for many OOP languages: https://en.wikipedia.org/wiki/Iterator_pattern

Why does a class need iter() to return an iterator?

Tags:

python

iterator

oop

python-3.x

class

Why does a class need to define __iter__() returning self, to get an iterator of the class?

class MyClass:
    def __init__(self):
        self.state = 0

    def __next__(self):
        self.state += 1
        if self.state > 4:
            raise StopIteration
        return self.state

myObj = MyClass()
for i in myObj:
    print(i)

Console log:

Traceback (most recent call last):
   for i in myObj:
TypeError: 'MyClass' object is not iterable

the answer https://stackoverflow.com/a/9884259/4515198, says

An iterator is an object with a next (Python 2) or __next__ (Python 3) method.

The task of adding the following:

def __iter__(self):
   return self

is to return an iterator, or an object of the class, which defines the __next__() method.

But, isn't the task of returning an object of MyClass (which defines the __next__() method) already done by the __new__() method, when MyClass is instantiated in the line myObj = MyClass() ?

Won't the objects of a class defining __next__() method, be iterators by themselves?

I have studied the questions What is the use of returning self in the __iter__ method? and Build a Basic Python Iterator, but I am still unable to understand the reason for having an __iter__() method returning self.

906

asked Nov 05 '16 15:11

satvik.t

1 Answers

The answer to the question of why the __iter__() method is necessary is that for for-loops always start by calling iter() on an object to get an iterator. That is why even iterators themselved need an __iter__() method to work with for-loops. After for calls iter(), then it calls __next__() on the resulting iterator to obtain a value.

The rules for creating iterables and iterators are:

1) Iterables have an __iter__() method that returns an iterator.

2) Iterators have a __next__() method that returns a value, that updates the state, and that raises StopIteration when complete.

3) Iterators themselves have a __iter__() method that returns self. That means that all iterators are self-iterable.

The benefit of the last rule for iterators having an __iter__() method that returns self is that it allows us to pass around partially consumed iterators:

>>> s = 'hello world'
>>> it = iter(s)
>>> next(it)
'h'
>>> next(it)
'e'
>>> list(it)     # Doesn't start from the beginning
['l', 'l', 'o', ' ', 'w', 'o', 'r', 'l', 'd']

Here's another example that depends on iterators being self-iterable without restarting:

>>> s = 'hello world'
>>> it = iter(s)
>>> list(zip(it, it))
[('h', 'e'), ('l', 'l'), ('o', ' '), ('w', 'o'), ('r', 'l')]

Notes:

1) An alternative way to make an iterable is to supply a __getitem__() method that accepts consecutive indices and raises IndexError when complete. This is how str objects iterated in Python 2.

2) Some objects like files are their own iterator. That means that you can call next() directly on a file object. It also means that files cannot have multiple, independent iterators (the file object itself has the state tracking the position within the file).

3) The iterator design pattern described above isn't Python specific. It is a general purpose design pattern for many OOP languages: https://en.wikipedia.org/wiki/Iterator_pattern

answered Oct 11 '22 23:10

Raymond Hettinger

Related questions
                            
                                How to use ModelMultipleChoiceFilter?
                            
                                Splitting one NumPy array into two arrays
                            
                                How to run raw mongodb commands from pymongo
                            
                                Pass parameter with Python Flask in external Javascript
                            
                                Import caffe error
                            
                                Pandas read_table use first column as index
                            
                                DynamoDBNumberError on trying to insert floating point number using python boto library
                            
                                Group and average NumPy matrix
                            
                                Memory efficient way to split large numpy array into train and test
                            
                                non-blocking lock with 'with' statement
                            
                                How to detect if a point is contained within a bounding rect - opecv & python
                            
                                Luigi Pipeline beginning in S3
                            
                                Callbacks with ctypes (How to call a python function from C)
                            
                                Problems implementing an XOR gate with Neural Nets in Tensorflow
                            
                                Interpolating a closed curve using scipy
                            
                                How do I order fields of my Row objects in Spark (Python)
                            
                                How can I send an email using python logging's SMTPHandler and SSL
                            
                                Doing pairwise distance computation with TensorFlow
                            
                                How to fillna() with value 0 after calling resample?
                            
                                Spyder / iPython inline plot figure size

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does a class need iter() to return an iterator?

Tags:

python

iterator

oop

python-3.x

class

satvik.t

People also ask

1 Answers

Raymond Hettinger

Recent Activity

Donate For Us

Why does a class need __iter__() to return an iterator?

Tags:

python

iterator

oop

python-3.x

class

satvik.t

People also ask

1 Answers

Raymond Hettinger

Related questions

Recent Activity

Donate For Us

Why does a class need iter() to return an iterator?