I'm supposed to take a list of words and count all words in it which are 2 or more characters long and where the first and last character are equal. I came up with two possible solutions: <pre class="prettyprint"><code>result = 0 for word in words: if len(word) >= 2 and word[0] == word[-1]: result += 1 return result </code></pre> vs. <pre class="prettyprint"><code>return len([word for word in words if len(word) >= 2 and word[0] == word[-1]]) </code></pre> Which one would be the preferred solution? Or are there even better ones?

In your second example a generator expression would be better than list-comp if your list is large. <pre class="prettyprint"><code>sum(1 for word in words if len(word) >= 2 and word[0] == word[-1]) </code></pre>

The first one would definitely be the preferred solution in Python. Don't forget your Zen of Python: <blockquote> The Zen of Python, by Tim Peters Beautiful is better than ugly. Explicit is better than implicit. Simple is better than complex. Complex is better than complicated. Flat is better than nested. Sparse is better than dense. Readability counts. Special cases aren't special enough to break the rules. Although practicality beats purity. Errors should never pass silently. Unless explicitly silenced. In the face of ambiguity, refuse the temptation to guess. There should be one-- and preferably only one --obvious way to do it. Although that way may not be obvious at first unless you're Dutch. Now is better than never. Although never is often better than right now. If the implementation is hard to explain, it's a bad idea. If the implementation is easy to explain, it may be a good idea. Namespaces are one honking great idea -- let's do more of those! </blockquote> Other than that your solutions are good.

Some other variants you might want to consider: First, you can break the filter condition into a function. This condition is fine either way, but if it becomes any more complex I'd definitely do this: <pre class="prettyprint"><code>def check(word): return len(word) >= 2 and word[0] == word[-1] sum(1 for word in words if check(word)) </code></pre> Next, if generating a list (as in the original list comprehension) is acceptable, then you can do this: <pre class="prettyprint"><code>len(filter(check, words)) </code></pre> There's itertools.ifilter, but if you use that you need to use the <code>sum</code> expression again, so it doesn't end up any clearer. The <code>sum</code> trick comes up so often that I'm surprised there isn't a standard library call to count the number of items in an iterator (if there is, I havn't found it). Alternatively, it'd make sense if <code>len</code> would consume and count the number of entries in an iterator if it has no <code>__len__</code>, but it doesn't.

List comprehension and len() vs. simple for loop

Tags:

python

list-comprehension

I'm supposed to take a list of words and count all words in it which are 2 or more characters long and where the first and last character are equal.

I came up with two possible solutions:

result = 0
for word in words:
    if len(word) >= 2 and word[0] == word[-1]:
        result += 1
return result

vs.

return len([word for word in words if len(word) >= 2 and word[0] == word[-1]])

Which one would be the preferred solution? Or are there even better ones?

672

asked Nov 02 '10 23:11

helpermethod

5 Answers

In your second example a generator expression would be better than list-comp if your list is large.

sum(1 for word in words if len(word) >= 2 and word[0] == word[-1])

158

answered Oct 04 '22 18:10

mechanical_meat

The first one would definitely be the preferred solution in Python.

Don't forget your Zen of Python:

The Zen of Python, by Tim Peters

Beautiful is better than ugly.

Explicit is better than implicit.

Simple is better than complex.

Complex is better than complicated.

Flat is better than nested.

Sparse is better than dense.

Readability counts.

Special cases aren't special enough to break the rules.

Although practicality beats purity.

Errors should never pass silently.

Unless explicitly silenced.

In the face of ambiguity, refuse the temptation to guess.

There should be one-- and preferably only one --obvious way to do it.

Although that way may not be obvious at first unless you're Dutch.

Now is better than never.

Although never is often better than right now.

If the implementation is hard to explain, it's a bad idea.

If the implementation is easy to explain, it may be a good idea.

Namespaces are one honking great idea -- let's do more of those!

Other than that your solutions are good.

answered Oct 04 '22 16:10

jacrough

I personally find the explicit loop more readable, but it's much a matter of taste (some prefer shorter code generally, especially when they have to write it).

Either version can be further shortened/improved:

result = 0
for word in words:
    result += int(len(word) >= 2 and word[0] == word[-1])
return result

The int() conversions is strictly speaking unnecessary, since True is a kind of 1, but it may be better for readability. The same approach can apply to the comprehension:

return sum(len(word) >= 2 and word[0] == word[-1] for word in words)

If you want to use len(), I'd point the reader to the fact that the values don't really matter:

len(1 for word in words if len(word) >= 2 and word[0] == word[-1])

answered Oct 04 '22 17:10

Martin v. Löwis

Both are pretty good.

There are small differences:

List comprehension returns another list which you are passing to len. The first solution avoids creation of another list.

answered Oct 04 '22 18:10

pyfunc

Some other variants you might want to consider:

First, you can break the filter condition into a function. This condition is fine either way, but if it becomes any more complex I'd definitely do this:

def check(word):
    return len(word) >= 2 and word[0] == word[-1]
sum(1 for word in words if check(word))

Next, if generating a list (as in the original list comprehension) is acceptable, then you can do this:

len(filter(check, words))

There's itertools.ifilter, but if you use that you need to use the sum expression again, so it doesn't end up any clearer.

The sum trick comes up so often that I'm surprised there isn't a standard library call to count the number of items in an iterator (if there is, I havn't found it). Alternatively, it'd make sense if len would consume and count the number of entries in an iterator if it has no __len__, but it doesn't.

answered Oct 04 '22 16:10

Glenn Maynard

Related questions
                            
                                PyCharm can't find Spacy Model 'en'
                            
                                Unused variable in a for loop
                            
                                Map an image onto a sphere and plot 3D trajectories
                            
                                Applying Regex across entire column of a Dataframe
                            
                                Anaconda-Jupyter Doesn't open in browser
                            
                                Zipping two arrays of n and 2n length to form a dictionary
                            
                                pip - No module named 'pip' even after successful installation
                            
                                name 'IntegrityError' is not defined - why can't I used 'except: IntegrityError' on this script?
                            
                                Running Python Code in .NET Environment without Installing Python
                            
                                Reshape long to wide using columns names
                            
                                How do I convert a 3D point cloud (.ply) into a mesh (with faces and vertices)?
                            
                                Python Ctypes - loading dll throws OSError: [WinError 193] %1 is not a valid Win32 application
                            
                                How to ignore certain Python errors from Sentry capture
                            
                                Building Wheel For Pycares (Setup.Py) Error
                            
                                Is this idiom pythonic? (someBool and "True Result" or "False Result")
                            
                                What is the difference between these two solutions - lambda or loop - Python
                            
                                Git library for Ruby or Python?
                            
                                Which is most accurate way to distinguish one of 8 colors?
                            
                                How to check whether elements appears in the list only once in python?
                            
                                python and sqlite - escape input

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With