How to count the NaN values in a column in pandas DataFrame

Tags:

I want to find the number of NaN in each column of my data so that I can drop a column if it has fewer NaN than some threshold. I looked but wasn't able to find any function for this. value_counts is too slow for me because most of the values are distinct and I'm only interested in the NaN count.

537

asked Oct 08 '14 21:10

user3799307

1 Answers

You can use the isna() method (or it's alias isnull() which is also compatible with older pandas versions < 0.21.0) and then sum to count the NaN values. For one column:

In [1]: s = pd.Series([1,2,3, np.nan, np.nan])  In [4]: s.isna().sum()   # or s.isnull().sum() for older pandas versions Out[4]: 2

For several columns, it also works:

In [5]: df = pd.DataFrame({'a':[1,2,np.nan], 'b':[np.nan,1,np.nan]})  In [6]: df.isna().sum() Out[6]: a    1 b    2 dtype: int64

139

answered Sep 19 '22 16:09

joris

Related questions
                            
                                Create list of single item repeated N times
                            
                                Does Python's time.time() return the local or UTC timestamp?
                            
                                Filter dict to contain only certain keys?
                            
                                How to calculate number of days between two given dates
                            
                                Add a new item to a dictionary in Python [duplicate]
                            
                                How to urlencode a querystring in Python?
                            
                                ImportError: Cannot import name X
                            
                                Can I force pip to reinstall the current version?
                            
                                TensorFlow not found using pip
                            
                                Split string with multiple delimiters in Python [duplicate]
                            
                                Remove specific characters from a string in Python
                            
                                How do I get indices of N maximum values in a NumPy array?
                            
                                Append integer to beginning of list in Python [duplicate]
                            
                                Unzipping files in Python
                            
                                Saving utf-8 texts with json.dumps as UTF8, not as \u escape sequence
                            
                                How to filter Pandas dataframe using 'in' and 'not in' like in SQL
                            
                                How to make a timezone aware datetime object in Python?
                            
                                Shuffle DataFrame rows
                            
                                Python progression path - From apprentice to guru
                            
                                Set value for particular cell in pandas DataFrame using index

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to count the NaN values in a column in pandas DataFrame

Tags:

python

pandas

dataframe

user3799307

People also ask

1 Answers

joris

Recent Activity

Donate For Us