Column with list of strings in python

Tags:

I have a pandas dataframe like the following:

                                          categories  review_count
0                  [Burgers, Fast Food, Restaurants]           137
1                         [Steakhouses, Restaurants]           176
2  [Food, Coffee & Tea, American (New), Restaurants]           390
...                                          ....              ...
...                                          ....              ...
...                                          ....              ...

From this dataFrame,I would like to extract only those rows wherein the list in the 'categories' column of that row contains the category 'Restaurants'. I have so far tried: df[[df.categories.isin('Restaurants'),review_count]],

as I also have other columns in the dataFrame, I specified these two columns that I want to extract. But I get the error:

TypeError: unhashable type: 'list'

I don't have much idea what this error means as I am very new to pandas. Please let me know how I can achieve my goal of extracting only those rows from the dataFrame wherein the 'categories' column for that row has the string 'Restaurants' as part of the categories_list. Any help would be much appreciated.

Thanks in advance!

761

asked Oct 13 '13 23:10

anonuser0428

1 Answers

I think you may have to use a lambda function for this, since you can test whether a value in your column isin some sequence, but pandas doesn't seem to provide a function for testing whether the sequence in your column contains some value:

import pandas as pd
categories = [['fast_food', 'restaurant'], ['coffee', 'cafe'], ['burger', 'restaurant']]
counts = [137, 176, 390]
df = pd.DataFrame({'categories': categories, 'review_count': counts})
# Show which rows contain 'restaurant'
df.categories.map(lambda x: 'restaurant' in x)
# Subset the dataframe using this:
df[df.categories.map(lambda x: 'restaurant' in x)]

Output:

Out[11]: 
                categories  review_count
0  [fast_food, restaurant]           137
2     [burger, restaurant]           390

130

answered Sep 29 '22 16:09

Marius

Related questions
                            
                                Better handling of KeyboardInterrupt in cmd.Cmd command line interpreter
                            
                                Monitoring gevent exceptions in jobs
                            
                                celery - call function on task done
                            
                                Cygwin Python 2.7 package
                            
                                How uninstall pycharm and rubymine?
                            
                                multiprocessing and garbage collection
                            
                                Example of how to use PyLZMA
                            
                                Determine the user language in Pyramid
                            
                                Launch an independent process with python
                            
                                Multi dimensional arrays in Python of a dynamic size
                            
                                How to avoid blocking code in python with gevent?
                            
                                How to get the "next" item in an OrderedDict?
                            
                                global name 're' is not defined
                            
                                Not possible to set content-type to application/json using urllib2
                            
                                FREAK Descriptor with Opencv Python
                            
                                Merging two tables with millions of rows in Python
                            
                                Indices of fixed size sub-matrices of numpy array
                            
                                Why is this Python code running twice? [duplicate]
                            
                                When should I use setUpClass and when __init__?
                            
                                How to catch this Python exception: error: [Errno 10054] An existing connection was forcibly closed by the remote host

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Column with list of strings in python

Tags:

python

slice

pandas

dataframe

anonuser0428

People also ask

1 Answers

Marius

Recent Activity

Donate For Us