How to check if a column contains list

Tags:

python

pandas

import pandas as pd

df = pd.DataFrame({"col1": ["a", "b", "c", ["a", "b"]]})

I have a dataframe like this, and I want to find the rows that contains list in that column. I tried value_counts() but it tooks so long and throws error at the end. Here is the error:

TypeError                                 Traceback (most recent call last)
pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.map_locations()

TypeError: unhashable type: 'list'
Exception ignored in: 'pandas._libs.index.IndexEngine._call_map_locations'
Traceback (most recent call last):
  File "pandas/_libs/hashtable_class_helper.pxi", line 1709, in pandas._libs.hashtable.PyObjectHashTable.map_locations
TypeError: unhashable type: 'list'
c         1
a         1
[a, b]    1
b         1
Name: col1, dtype: int64

For bigger dataframes this tooks forever.

Here is how the desired output look like:

col1
c       1
b       1
[a,b]   1
dtype: int64

321

asked Nov 05 '20 18:11

erentknn

1 Answers

Iterate on rows and check type of obj in column by this condition: type(obj) == list

import pandas as pd

df = pd.DataFrame({"col1": ["a", "b", "c", ["a", "b"]]})

for ind in df.index:
   print (type(df['col1'][ind]) == list)

And here is the result:

False
False
False
True

136

answered Oct 23 '22 03:10

Pouya Esmaeili

Related questions
                            
                                How do you use pipenv in a GitHub action?
                            
                                Cascade multiple RNN models for N-dimensional output
                            
                                Can't find model 'en_core_web_md'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory
                            
                                How to save and edit server rendering data?
                            
                                Installing socketio module on python3 seems to be corrupting pip
                            
                                Identify the first and all non-zero values in every row in Pandas DataFrame
                            
                                How to convert a sklearn pipeline into a pyspark pipeline?
                            
                                Delete diagonals of zero elements
                            
                                Is there a way to use Python 3.9 type hinting in its previous versions?
                            
                                Kivy sounds do not play on android device even though they play fine on laptop
                            
                                enabling CORS Google Cloud Function (Python)
                            
                                How to read, format, sort, and save a csv file, without pandas
                            
                                what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer?
                            
                                Updating dataframe value based on list
                            
                                How to update a pandas dataframe, from multiple API calls
                            
                                How to I compute matching features between high resolution images?
                            
                                How to register typing.Callable with Python @singledispatch?
                            
                                tf.newaxis operation in TensorFlow
                            
                                'tuple' object has no attribute '_committed' error while updating image objects?
                            
                                Faster for-loops with arrays in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With