How to determine whether a column/variable is numeric or not in Pandas/NumPy?

2 Answers

In pandas 0.20.2 you can do:

import pandas as pd from pandas.api.types import is_string_dtype from pandas.api.types import is_numeric_dtype  df = pd.DataFrame({'A': ['a', 'b', 'c'], 'B': [1.0, 2.0, 3.0]})  is_string_dtype(df['A']) >>>> True  is_numeric_dtype(df['B']) >>>> True

127

answered Sep 24 '22 20:09

danthelion

You can use np.issubdtype to check if the dtype is a sub dtype of np.number. Examples:

np.issubdtype(arr.dtype, np.number)  # where arr is a numpy array np.issubdtype(df['X'].dtype, np.number)  # where df['X'] is a pandas Series

This works for numpy's dtypes but fails for pandas specific types like pd.Categorical as Thomas noted. If you are using categoricals is_numeric_dtype function from pandas is a better alternative than np.issubdtype.

df = pd.DataFrame({'A': [1, 2, 3], 'B': [1.0, 2.0, 3.0],                     'C': [1j, 2j, 3j], 'D': ['a', 'b', 'c']}) df Out:     A    B   C  D 0  1  1.0  1j  a 1  2  2.0  2j  b 2  3  3.0  3j  c  df.dtypes Out:  A         int64 B       float64 C    complex128 D        object dtype: object

np.issubdtype(df['A'].dtype, np.number) Out: True  np.issubdtype(df['B'].dtype, np.number) Out: True  np.issubdtype(df['C'].dtype, np.number) Out: True  np.issubdtype(df['D'].dtype, np.number) Out: False

For multiple columns you can use np.vectorize:

is_number = np.vectorize(lambda x: np.issubdtype(x, np.number)) is_number(df.dtypes) Out: array([ True,  True,  True, False], dtype=bool)

And for selection, pandas now has select_dtypes:

df.select_dtypes(include=[np.number]) Out:     A    B   C 0  1  1.0  1j 1  2  2.0  2j 2  3  3.0  3j

answered Sep 25 '22 20:09

ayhan

Related questions
                            
                                Why does csvwriter.writerow() put a comma after each character?
                            
                                Does Python have a toString() equivalent, and can I convert a class to String?
                            
                                Save list of DataFrames to multisheet Excel spreadsheet
                            
                                python selenium click on button
                            
                                Link to Flask static files with url_for
                            
                                Python Unicode Encode Error
                            
                                How do I get rid of the b-prefix in a string in python?
                            
                                Why do tuples take less space in memory than lists?
                            
                                Inheritance and init method in Python
                            
                                Copy multiple files in Python
                            
                                How to determine whether a substring is in a different string [duplicate]
                            
                                Call Python script from bash with argument
                            
                                What does hash do in python?
                            
                                Construct pandas DataFrame from items in nested dictionary
                            
                                Pandas split DataFrame by column value
                            
                                What is the id( ) function used for?
                            
                                How to calculate probability in a normal distribution given mean & standard deviation?
                            
                                Add Text on Image using PIL
                            
                                Print new output on same line [duplicate]
                            
                                bash: mkvirtualenv: command not found

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to determine whether a column/variable is numeric or not in Pandas/NumPy?

Tags:

python

pandas

numpy

user2808117

People also ask

2 Answers

danthelion

ayhan

Recent Activity

Donate For Us