Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Correct style for line breaks when chaining methods in Python

I have some code like this. Should the break occur before the periods or after?

# before my_var = somethinglikethis.where(we=do_things).where(we=domore).where(we=everdomore)  # this way my_var = somethinglikethis.where(we=do_things) \                           .where(we=domore) \                           .where(we=everdomore)  # or this way my_var = somethinglikethis.where(we=do_things). \                            where(we=domore). \                            where(we=everdomore) 
like image 586
JiminyCricket Avatar asked Oct 30 '11 00:10

JiminyCricket


People also ask

How do you format a line break in Python?

In Python, the new line character “\n” is used to create a new line. When inserted in a string all the characters after the character are added to a new line. Essentially the occurrence of the “\n” indicates that the line ends here and the remaining characters would be displayed in a new line.

What is chaining in pandas?

Pandas chaining is an alternative to variable assignment when transforming data. Those in favor of chaining argue that the code is easier to read because it lays out the execution of the transformation like a recipe.


2 Answers

PEP 8 recommends using parenthesis so that you don't need \, and gently suggests breaking before binary operators instead of after them. Thus, the preferred way of formatting you code is like this:

my_var = (somethinglikethis           .where(we=do_things)           .where(we=domore)           .where(we=everdomore)) 

The two relevant passages are this one from the Maximum Line Length section:

The preferred way of wrapping long lines is by using Python's implied line continuation inside parentheses, brackets and braces. Long lines can be broken over multiple lines by wrapping expressions in parentheses. These should be used in preference to using a backslash for line continuation.

... and the entire Should a line break before or after a binary operator? section:

Should a line break before or after a binary operator?

For decades the recommended style was to break after binary operators. But this can hurt readability in two ways: the operators tend to get scattered across different columns on the screen, and each operator is moved away from its operand and onto the previous line. Here, the eye has to do extra work to tell which items are added and which are subtracted:

# No: operators sit far away from their operands income = (gross_wages +           taxable_interest +           (dividends - qualified_dividends) -           ira_deduction -           student_loan_interest) 

To solve this readability problem, mathematicians and their publishers follow the opposite convention. Donald Knuth explains the traditional rule in his Computers and Typesetting series: "Although formulas within a paragraph always break after binary operations and relations, displayed formulas always break before binary operations"

Following the tradition from mathematics usually results in more readable code:

# Yes: easy to match operators with operands income = (gross_wages           + taxable_interest           + (dividends - qualified_dividends)           - ira_deduction           - student_loan_interest) 

In Python code, it is permissible to break before or after a binary operator, as long as the convention is consistent locally. For new code Knuth's style is suggested.

Note that, as indicated in the quote above, PEP 8 used to give the opposite advice about where to break around an operator, quoted below for posterity:

The preferred way of wrapping long lines is by using Python's implied line continuation inside parentheses, brackets and braces. Long lines can be broken over multiple lines by wrapping expressions in parentheses. These should be used in preference to using a backslash for line continuation. Make sure to indent the continued line appropriately. The preferred place to break around a binary operator is after the operator, not before it. Some examples:

class Rectangle(Blob):      def __init__(self, width, height,                  color='black', emphasis=None, highlight=0):         if (width == 0 and height == 0 and             color == 'red' and emphasis == 'strong' or             highlight > 100):             raise ValueError("sorry, you lose")         if width == 0 and height == 0 and (color == 'red' or                                            emphasis is None):             raise ValueError("I don't think so -- values are %s, %s" %                              (width, height))         Blob.__init__(self, width, height,                       color, emphasis, highlight) 
like image 84
Bastien Léonard Avatar answered Oct 07 '22 00:10

Bastien Léonard


PEP 8 says that breaking before the operator is preferred:

Donald Knuth explains the traditional rule in his Computers and Typesetting series: "Although formulas within a paragraph always break after binary operations and relations, displayed formulas always break before binary operations".

...

In Python code, it is permissible to break before or after a binary operator, as long as the convention is consistent locally. For new code Knuth's style is suggested.

https://www.python.org/dev/peps/pep-0008/#should-a-line-break-before-or-after-a-binary-operator

like image 21
Neapolitan Avatar answered Oct 06 '22 23:10

Neapolitan