I have a <code>DataFrame</code>. 1 column (<code>name</code>) has string values. I was wondering if there was a way to select rows based on a partial string match against a particular column, using the <code>DataFrame.query()</code> method. I tried: <ul> <li> <code>df.query('name.str.contains("lu")')</code>. Error message: "TypeError: 'Series' objects are mutable, thus they cannot be hashed"</li> <li> <code>df.query('"lu" in name')</code>. Returns an empty <code>DataFrame</code>.</li> </ul> The code I use: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame({ 'name':['blue','red','blue'], 'X1':[96.32,96.01,96.05] }, columns=['name','X1']) print(df.query('"lu" in name').head()) print(df.query('name.str.contains("lu")').head()) </code></pre> I know I could use <code>df[df['name'].str.contains("lu")]</code> but I prefer to use query.

The issue that @ayhan refers to now shows how this can be achieved by using <code>query</code>'s python engine: <code>print(df.query('name.str.contains("lu")', engine='python').head())</code> should work.

Select rows by partial string with query with pandas

Tags:

python

pandas

dataframe

I have a DataFrame. 1 column (name) has string values. I was wondering if there was a way to select rows based on a partial string match against a particular column, using the DataFrame.query() method.

I tried:

df.query('name.str.contains("lu")'). Error message: "TypeError: 'Series' objects are mutable, thus they cannot be hashed"
df.query('"lu" in name'). Returns an empty DataFrame.

The code I use:

import pandas as pd

df = pd.DataFrame({
    'name':['blue','red','blue'],
    'X1':[96.32,96.01,96.05]
}, columns=['name','X1'])  


print(df.query('"lu" in name').head())
print(df.query('name.str.contains("lu")').head())

I know I could use df[df['name'].str.contains("lu")] but I prefer to use query.

818

asked Jul 05 '17 18:07

Franck Dernoncourt

1 Answers

The issue that @ayhan refers to now shows how this can be achieved by using query's python engine:

print(df.query('name.str.contains("lu")', engine='python').head())

should work.

108

answered Oct 25 '22 01:10

petobens

Related questions
                            
                                What is the use of returning self in the __iter__ method?
                            
                                Python Tkinter Entry get()
                            
                                python abstract base classes, difference between mixin & abstract method
                            
                                Call column in dataframe by column index instead of column name - pandas
                            
                                Python: How can I tell if my python has SSL?
                            
                                Difference between reverse and [::-1]
                            
                                python decorate function call
                            
                                Zen of Python: Errors should never pass silently. Why does zip work the way it does?
                            
                                Represent infinity as an integer in Python 2.7
                            
                                Sort a sublist of elements in a list leaving the rest in place
                            
                                Why is print("text" + str(var1) + "more text" + str(var2)) described as "disapproved"?
                            
                                Sort a list of tuples in consecutive order
                            
                                Multiple 'for' loops in dictionary generator
                            
                                Rotate minor ticks in matplotlib
                            
                                Python: How to NOT wait for a thread to finish to carry on?
                            
                                Can't run binary from within python aws lambda function
                            
                                Is it possible to show multiple plots in separate windows using matplotlib?
                            
                                Replace multiple newlines with single newlines during reading file
                            
                                Matplotlib - Changing the color of a single x-axis tick label
                            
                                A Better Way to Calculate Odd Ratio in Pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With