Python Pandas: Check if all columns in rows value is NaN

Tags:

Kindly accept my apologies if my question has already been answered. I tried to find a solution but all I can find is to dropna solution for all NaN's in a dataframe. My question is that I have a dataframe with 6 columns and 500 rows. I need to check if in any particular row all the values are NaN so that I can drop them from my dataset. Example below row 2, 6 & 7 contains all Nan from col1 to col6:

    Col1    Col2    Col3    Col4    Col5    Col6
    12      25      02      78      88      90
    Nan     Nan     Nan     Nan     Nan     Nan
    Nan     35      03      11      65      53
    Nan     Nan     Nan     Nan     22      21
    Nan     15      93      111     165     153
    Nan     Nan     Nan     Nan     Nan     Nan
    Nan     Nan     Nan     Nan     Nan     Nan
    141     121     Nan     Nan     Nan     Nan

Please note that top row is just headings and from 2nd row on wards my data starts. Will be grateful if anyone can help me in right direction to solve this puzzle.

And also my 2nd question is that after deleting all Nan in all columns if I want to delete the rows where 4 or 5 columns data is missing then what will be the best solution.

and last question is, is it possible after deleting the rows with most Nan's then how can I create box plot on the remaining for example 450 rows?

Any response will be highly appreciated.

Regards,

754

asked Sep 02 '16 18:09

Baig

1 Answers

For those search because wish to know on the question title:

Check if all columns in rows value is NaN

A simple approach would be:

df[[list_of_cols_to_check]].isnull().apply(lambda x: all(x), axis=1)

import pandas as pd
import numpy as np


df = pd.DataFrame({'movie': [np.nan, 'thg', 'mol', 'mol', 'lob', 'lob'],
                  'rating': [np.nan, 4., 5., np.nan, np.nan, np.nan],
                  'name':   ['John', np.nan, 'N/A', 'Graham', np.nan, np.nan]}) 
df.head()

enter image description here

To check if all columns is NaN:

cols_to_check = df.columns
df['is_na'] = df[cols_to_check].isnull().apply(lambda x: all(x), axis=1) 
df.head()

enter image description here

To check if columns 'name', 'rating' are NaN:

cols_to_check = ['name', 'rating']
df['is_na'] = df[cols_to_check].isnull().apply(lambda x: all(x), axis=1) 
df.head()

enter image description here

152

answered Nov 14 '22 22:11

Wong Tat Yau

Related questions
                            
                                Python library for data scaling, centering and Box-Cox transformation
                            
                                Generate x+xx+xxx+xxxx ... for a given integer (for 4 -> 4+44+444...)
                            
                                AttributeError: 'RegexURLPattern' object has no attribute '_callback'
                            
                                How can I use versioning in S3 with boto3?
                            
                                Can I dynamically pass a list of arguments to a python function with predefined arguments? [duplicate]
                            
                                Django Allauth Error: "'socialaccount' is not a registered tag library"
                            
                                Matplotlib animation inside your own GUI
                            
                                Easiest way to generate random int64 array in numpy?
                            
                                Setting kivy window size not working
                            
                                TypeError: 'numpy.float64' object is not callable?
                            
                                How to get Windows window names with ctypes in python
                            
                                No module named bidi.algorithm
                            
                                Hard limiting / threshold activation function in TensorFlow
                            
                                python lambda list filtering with multiple conditions
                            
                                Add/subtract dataframes with different column labels
                            
                                Why bisect slower than sort
                            
                                print UTF-8 character in Python 2.7
                            
                                output a dataframe to a json array
                            
                                Scrapy Images Downloading
                            
                                LabelEncoder specify classes in DataFrame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas: Check if all columns in rows value is NaN

Tags:

python

pandas

nan

Baig

People also ask

1 Answers

Wong Tat Yau

Recent Activity

Donate For Us