I've got a dF that's over 100k rows long, and a few columns wide — nothing crazy. I'm trying to subset the rows based on a list of some 4000 strings, but am struggling to figure out how to do so. Is there a way to subset using something like. The dF looks something like this <pre class="prettyprint"><code>dog_name count =================== Jenny 2 Fido 4 Joey 7 Yeller 2 </code></pre> and the list of strings is contained the variable <code>dog_name_list=['Fido', 'Yeller']</code> I've tried something along the lines of <code>df[df['dog_name'].isin(dog_name_list)</code>, but am getting a fun error: <code>unhashable type: 'list' </code> I've checked a similar question, the docs and this rundown for subsetting data frames by seeing whether a value is present in a list, but that's got me right about nowhere, and I'm a little confused by what I'm missing. Would really appreciate someone's advice!

I believe you have a list in your dog name column. This works fine: <pre class="prettyprint"><code>>>> df[df['dog_name'].isin(['Fido', 'Yeller'])] dog_name count 1 Fido 4 3 Yeller 2 </code></pre> But if you add a list: <pre class="prettyprint"><code>df.ix[4] = (['a'], 2) >>> df dog_name count 0 Jenny 2 1 Fido 4 2 Joey 7 3 Yeller 2 4 [a] 2 >>> df[df['dog_name'].isin(['Fido', 'Yeller'])] --------------------------------------------------------------------------- TypeError Traceback (most recent call last) <ipython-input-20-1b68dd948f39> in <module>() ----> 1 df[df['dog_name'].isin(['Fido', 'Yeller'])] ... pandas/lib.pyx in pandas.lib.ismember (pandas/lib.c:5014)() TypeError: unhashable type: 'list' </code></pre> To find those bad dogs: <pre class="prettyprint"><code>>>> df[[isinstance(dog, list) for dog in df.dog_name]] dog_name count 4 [a] 2 </code></pre> To find all the data types in the column: <pre class="prettyprint"><code>>>> set((type(dog) for dog in df.dog_name)) {list, str} </code></pre>

How do I subset a pandas data frame based on a list of string values?

Tags:

python

pandas

I've got a dF that's over 100k rows long, and a few columns wide — nothing crazy. I'm trying to subset the rows based on a list of some 4000 strings, but am struggling to figure out how to do so. Is there a way to subset using something like.

The dF looks something like this

dog_name    count
===================
Jenny        2
Fido         4
Joey         7
Yeller       2

and the list of strings is contained the variable dog_name_list=['Fido', 'Yeller']

I've tried something along the lines of df[df['dog_name'].isin(dog_name_list), but am getting a fun error: unhashable type: 'list'

I've checked a similar question, the docs and this rundown for subsetting data frames by seeing whether a value is present in a list, but that's got me right about nowhere, and I'm a little confused by what I'm missing. Would really appreciate someone's advice!

249

asked Feb 11 '16 22:02

scrollex

1 Answers

I believe you have a list in your dog name column.

This works fine:

>>> df[df['dog_name'].isin(['Fido', 'Yeller'])]
  dog_name  count
1     Fido      4
3   Yeller      2

But if you add a list:

df.ix[4] = (['a'], 2)
>>> df
  dog_name  count
0    Jenny      2
1     Fido      4
2     Joey      7
3   Yeller      2
4      [a]      2

>>> df[df['dog_name'].isin(['Fido', 'Yeller'])]
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-20-1b68dd948f39> in <module>()
----> 1 df[df['dog_name'].isin(['Fido', 'Yeller'])]
...
pandas/lib.pyx in pandas.lib.ismember (pandas/lib.c:5014)()

TypeError: unhashable type: 'list'

To find those bad dogs:

>>> df[[isinstance(dog, list) for dog in df.dog_name]]
  dog_name  count
4      [a]      2

To find all the data types in the column:

>>> set((type(dog) for dog in df.dog_name))
{list, str}

answered Sep 19 '22 22:09

Alexander

Related questions
                            
                                Having trouble installing pycurl on windows
                            
                                TypeError: 'cmp' is an invalid keyword argument for this function
                            
                                Swiss tournament - pairing algorithm
                            
                                Getting the confidence level of detectMultiscale in OpenCV with Python?
                            
                                uWSGI python highload configuration
                            
                                Python and Java parameter passing [duplicate]
                            
                                matplotlib: set title color in stylesheet
                            
                                Nest a flat list based on an arbitrary criterion
                            
                                Time complexity of python "set.intersection" for n sets
                            
                                Pylint for half-implemented abstract classes
                            
                                How to do `PUT` on Amazon S3 using Python Requests
                            
                                Python: POSIX character class in regex?
                            
                                Python + WSGI - Can't import my own modules from a directory?
                            
                                Why is bytearray not a Sequence in Python 2?
                            
                                Preserving Column Order - Python Pandas and Column Concat
                            
                                Is there a way to have platform-specific dependencies in environment.yml?
                            
                                Django SimpleUploadedFile with Python 3
                            
                                Cannot press button
                            
                                multiple assignments with a comma in python
                            
                                Why does my Spark run slower than pure Python? Performance comparison

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With