Find empty or NaN entry in Pandas Dataframe

Tags:

I am trying to search through a Pandas Dataframe to find where it has a missing entry or a NaN entry.

Here is a dataframe that I am working with:

cl_id       a           c         d         e        A1              A2             A3     0       1   -0.419279  0.843832 -0.530827    text76        1.537177      -0.271042     1       2    0.581566  2.257544  0.440485    dafN_6        0.144228       2.362259     2       3   -1.259333  1.074986  1.834653    system                       1.100353     3       4   -1.279785  0.272977  0.197011     Fifty       -0.031721       1.434273     4       5    0.578348  0.595515  0.553483   channel        0.640708       0.649132     5       6   -1.549588 -0.198588  0.373476     audio       -0.508501                    6       7    0.172863  1.874987  1.405923    Twenty             NaN            NaN     7       8   -0.149630 -0.502117  0.315323  file_max             NaN            NaN

NOTE: The blank entries are empty strings - this is because there was no alphanumeric content in the file that the dataframe came from.

If I have this dataframe, how can I find a list with the indexes where the NaN or blank entry occurs?

842

asked Nov 26 '14 21:11

edesz

1 Answers

np.where(pd.isnull(df)) returns the row and column indices where the value is NaN:

In [152]: import numpy as np In [153]: import pandas as pd In [154]: np.where(pd.isnull(df)) Out[154]: (array([2, 5, 6, 6, 7, 7]), array([7, 7, 6, 7, 6, 7]))  In [155]: df.iloc[2,7] Out[155]: nan  In [160]: [df.iloc[i,j] for i,j in zip(*np.where(pd.isnull(df)))] Out[160]: [nan, nan, nan, nan, nan, nan]

Finding values which are empty strings could be done with applymap:

In [182]: np.where(df.applymap(lambda x: x == '')) Out[182]: (array([5]), array([7]))

Note that using applymap requires calling a Python function once for each cell of the DataFrame. That could be slow for a large DataFrame, so it would be better if you could arrange for all the blank cells to contain NaN instead so you could use pd.isnull.

answered Sep 23 '22 10:09

unutbu

Related questions
                            
                                li:before{ content: "■"; } How to Encode this Special Character as a Bullit in an Email Stationery?
                            
                                List all devices, partitions and volumes in Powershell
                            
                                iterating over two values of a list at a time in python [duplicate]
                            
                                How to iterate over the first n elements of a list?
                            
                                find out the elements of an arraylist which is not present in another arraylist
                            
                                Generate all combinations from multiple lists
                            
                                Convert DataSet to List
                            
                                Getting a ConcurrentModificationException thrown when removing an element from a java.util.List during list iteration? [duplicate]
                            
                                Is list::size() really O(n)?
                            
                                type 'List<dynamic>' is not a subtype of type 'List<int>' where
                            
                                How can I exclude multiple folders using Get-ChildItem -exclude?
                            
                                How to get the closest number from a List<int> with LINQ?
                            
                                Select method in List<t> Collection
                            
                                Saving lists to txt file
                            
                                How to check if a list is ordered?
                            
                                How to get a distinct list from a List of objects?
                            
                                Is there a class like Dictionary<> in C#, but for just keys, no values?
                            
                                List Directories and get the name of the Directory
                            
                                Merge multiple column values into one column in python pandas
                            
                                Using any() and all() to check if a list contains one set of values or another

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Find empty or NaN entry in Pandas Dataframe

Tags:

list

indexing

pandas

dataframe

python-2.7

edesz

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us