I'm trying to select a subset of a subset of a dataframe, selecting only some columns, and filtering on the rows. <pre class="prettyprint"><code>df.loc[df.a.isin(['Apple', 'Pear', 'Mango']), ['a', 'b', 'f', 'g']] </code></pre> However, I'm getting the error: <pre class="prettyprint"><code>Passing list-likes to .loc or [] with any missing label will raise KeyError in the future, you can use .reindex() as an alternative. </code></pre> What 's the correct way to slice and filter now?

<h3>TL;DR: There is likely a typo or spelling error in the column header names.</h3> This is a change introduced in <code>v0.21.1</code>, and has been explained in the docs at length - <blockquote> Previously, selecting with a list of labels, where one or more labels were missing would always succeed, returning <code>NaN</code> for missing labels. This will now show a <code>FutureWarning</code>. In the future this will raise a <code>KeyError</code> (GH15747). This warning will trigger on a <code>DataFrame</code> or a <code>Series</code> for using <code>.loc[]</code> or <code>[[]]</code> when passing a list-of-labels with at least 1 missing label. </blockquote> For example, <pre class="prettyprint"><code>df A B C 0 7.0 NaN 8 1 3.0 3.0 5 2 8.0 1.0 7 3 NaN 0.0 3 4 8.0 2.0 7 </code></pre> Try some kind of slicing as you're doing - <pre class="prettyprint"><code>df.loc[df.A.gt(6), ['A', 'C']] A C 0 7.0 8 2 8.0 7 4 8.0 7 </code></pre> No problem. Now, try replacing <code>C</code> with a non-existent column label - <pre class="prettyprint"><code>df.loc[df.A.gt(6), ['A', 'D']] FutureWarning: Passing list-likes to .loc or [] with any missing label will raise KeyError in the future, you can use .reindex() as an alternative. A D 0 7.0 NaN 2 8.0 NaN 4 8.0 NaN </code></pre> So, in your case, the error is because of the column labels you pass to <code>loc</code>. Take another look at them.

Pandas slicing FutureWarning with 0.21.0

Tags:

python

slice

pandas

filter

I'm trying to select a subset of a subset of a dataframe, selecting only some columns, and filtering on the rows.

df.loc[df.a.isin(['Apple', 'Pear', 'Mango']), ['a', 'b', 'f', 'g']]

However, I'm getting the error:

Passing list-likes to .loc or [] with any missing label will raise KeyError in the future, you can use .reindex() as an alternative.

What 's the correct way to slice and filter now?

291

asked Dec 19 '17 22:12

QuinRiva

1 Answers

TL;DR: There is likely a typo or spelling error in the column header names.

This is a change introduced in v0.21.1, and has been explained in the docs at length -

Previously, selecting with a list of labels, where one or more labels were missing would always succeed, returning NaN for missing labels. This will now show a FutureWarning. In the future this will raise a KeyError (GH15747). This warning will trigger on a DataFrame or a Series for using .loc[] or [[]] when passing a list-of-labels with at least 1 missing label.

For example,

df       A    B  C 0  7.0  NaN  8 1  3.0  3.0  5 2  8.0  1.0  7 3  NaN  0.0  3 4  8.0  2.0  7

Try some kind of slicing as you're doing -

df.loc[df.A.gt(6), ['A', 'C']]       A  C 0  7.0  8 2  8.0  7 4  8.0  7

No problem. Now, try replacing C with a non-existent column label -

df.loc[df.A.gt(6), ['A', 'D']] FutureWarning: Passing list-likes to .loc or [] with any missing label will raise KeyError in the future, you can use .reindex() as an alternative.            A   D 0  7.0 NaN 2  8.0 NaN 4  8.0 NaN

So, in your case, the error is because of the column labels you pass to loc. Take another look at them.

175

answered Oct 14 '22 13:10

cs95

Related questions
                            
                                When should I ever use file.read() or file.readlines()?
                            
                                How do I set up a daemon with python-daemon?
                            
                                How does keras define "accuracy" and "loss"?
                            
                                Pandas add column with value based on condition based on other columns
                            
                                How to de-import a Python module?
                            
                                Should 3.4 enums use UPPER_CASE_WITH_UNDERSCORES?
                            
                                Can json.loads ignore trailing commas?
                            
                                Python : terminology 'class' VS 'type'
                            
                                Is django prefetch_related supposed to work with GenericRelation
                            
                                Why is Python 3 is considerably slower than Python 2? [duplicate]
                            
                                Performance of Redis vs Disk in caching application
                            
                                What is the global default timeout
                            
                                What Kivy Tutorials Are Available [closed]
                            
                                Is there a way to access the original function in a mocked method/function such that I can modify the arguments and pass it to the original functions?
                            
                                How can I print the values of Keras tensors?
                            
                                Does string slicing perform copy in memory? [duplicate]
                            
                                Chrome extension in python?
                            
                                Tools for static type checking in Python
                            
                                Unpacking generalizations
                            
                                Understanding the `ngram_range` argument in a CountVectorizer in sklearn

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With