I've got a dataset with a big number of rows. Some of the values are NaN, like this: <pre class="prettyprint"><code>In [91]: df Out[91]: 1 3 1 1 1 1 3 1 1 1 2 3 1 1 1 1 1 NaN NaN NaN 1 3 1 1 1 1 1 1 1 1 </code></pre> And I want to count the number of NaN values in each string, it would be like this: <pre class="prettyprint"><code>In [91]: list = <somecode with df> In [92]: list Out[91]: [0, 0, 0, 3, 0, 0] </code></pre> What is the best and fastest way to do it?

You could first find if element is <code>NaN</code> or not by <code>isnull()</code> and then take row-wise <code>sum(axis=1)</code> <pre class="prettyprint"><code>In [195]: df.isnull().sum(axis=1) Out[195]: 0 0 1 0 2 0 3 3 4 0 5 0 dtype: int64 </code></pre> And, if you want the output as list, you can <pre class="prettyprint"><code>In [196]: df.isnull().sum(axis=1).tolist() Out[196]: [0, 0, 0, 3, 0, 0] </code></pre> <hr> Or use <code>count</code> like <pre class="prettyprint"><code>In [130]: df.shape[1] - df.count(axis=1) Out[130]: 0 0 1 0 2 0 3 3 4 0 5 0 dtype: int64 </code></pre>

Python/Pandas: counting the number of missing/NaN in each row

In [91]: df Out[91]:  1    3      1      1      1  1    3      1      1      1  2    3      1      1      1  1    1    NaN    NaN    NaN  1    3      1      1      1  1    1      1      1      1

And I want to count the number of NaN values in each string, it would be like this:

In [91]: list = <somecode with df> In [92]: list     Out[91]:      [0,       0,       0,       3,       0,       0]

What is the best and fastest way to do it?

602

asked May 05 '15 17:05

Chernyavski.aa

1 Answers

You could first find if element is NaN or not by isnull() and then take row-wise sum(axis=1)

In [195]: df.isnull().sum(axis=1) Out[195]: 0    0 1    0 2    0 3    3 4    0 5    0 dtype: int64

And, if you want the output as list, you can

In [196]: df.isnull().sum(axis=1).tolist() Out[196]: [0, 0, 0, 3, 0, 0]

Or use count like

In [130]: df.shape[1] - df.count(axis=1) Out[130]: 0    0 1    0 2    0 3    3 4    0 5    0 dtype: int64

192

answered Sep 16 '22 13:09

Zero

Related questions
                            
                                Pandas 'describe' is not returning summary of all columns
                            
                                Remove non-numeric rows in one column with pandas
                            
                                Copy all values in a column to a new column in a pandas dataframe
                            
                                groupby weighted average and sum in pandas dataframe
                            
                                datetime to string with series in pandas
                            
                                Apache Airflow or Apache Beam for data processing and job scheduling
                            
                                Pandas: Sampling a DataFrame [duplicate]
                            
                                HDF5 - concurrency, compression & I/O performance [closed]
                            
                                ValueError: Wrong number of items passed - Meaning and suggestions?
                            
                                How to sort pandas data frame using values from several columns?
                            
                                Dynamically evaluate an expression from a formula in Pandas
                            
                                Pandas dataframe groupby plot
                            
                                Pandas add one day to column
                            
                                How to remove levels from a multi-indexed dataframe?
                            
                                Filtering multiple items in a multi-index Python Panda dataframe
                            
                                Python - rolling functions for GroupBy object
                            
                                How to slice a Pandas Data Frame by position?
                            
                                Python Pandas - Changing some column types to categories
                            
                                Access index in pandas.Series.apply
                            
                                No numeric types to aggregate - change in groupby() behaviour?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python/Pandas: counting the number of missing/NaN in each row

Tags:

pandas

dataframe

count

nan

row

Chernyavski.aa

People also ask

1 Answers

Zero

Recent Activity

Donate For Us