Why does pandas use (&, |) instead of the normal, pythonic (and, or)?

Tags:

python

pandas

I understand the pandas docs explain that this is the convention, but I was wondering why?

For example:

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.randn(6,4), index=list('abcdef'), columns=list('ABCD'))
print(df[(df.A < .5) | (df.B > .5)])
print(df[(df.A < .5) or (df.B > .5)])

Returns the following:

          A         B         C         D
a -0.284669 -0.277413  2.524311 -1.386008
b -0.190307  0.325620 -0.367727  0.347600
c -0.763290 -0.108921 -0.467167  1.387327
d -0.241815 -0.869941 -0.756848 -0.335477
e -1.583691 -0.236361 -1.007421  0.298171
f -3.173293  0.521770 -0.326190  1.604712
Traceback (most recent call last):
  File "C:\test.py", line 64, in <module>
    print(df[(df.A < .5) or (df.B > .5)])   
ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

739

asked Dec 19 '13 20:12

Ben Southgate

1 Answers

Because & and | are overridable (customizable). You can write the code that drives the operators for any class.

The logic operators and and or, on the other hand, have standard behavior that cannot be modified.

See here for the relevant documentation.

answered Oct 25 '22 11:10

slezica

Related questions
                            
                                python convert multiline to single line
                            
                                Django unittest and mocking the requests module
                            
                                Python installing xlwt module error
                            
                                Python - Looping through a multidimensional dictionary [duplicate]
                            
                                How does Python implement the modulo operation?
                            
                                pandas: DataFrame.mean() very slow. How can I calculate means of columns faster?
                            
                                Force python to print entire number
                            
                                Tkinter Not Found
                            
                                Why does scipy.ndimage.io.imread return PngImageFile, not an array of values
                            
                                How to have unique emails with python social auth
                            
                                Python: What's the use case for set.pop()?
                            
                                how to set values to rows of boolean filtered dataframe column
                            
                                How is pandas deciding order in a sort when there is a tie?
                            
                                Removing empty elements from an array in Python
                            
                                amplitude of numpy's fft results is to be multiplied by sampling period?
                            
                                Finding 3d distances using an inbuilt function in python
                            
                                Converting string that looks like a list into a real list - python
                            
                                How to replace values in a numpy array based on another column?
                            
                                how to make a grouped boxplot graph in matplotlib
                            
                                AttributeError: 'module' object has no attribute 'lowercase'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With