Count NaNs when unicode values present

Tags:

Good morning all,

I have a pandas dataframe containing multiple series. For a given series within the dataframe, the datatypes are unicode, NaN, and int/float. I want to determine the number of NaNs in the series but cannot use the built in numpy.isnan method because it cannot safely cast unicode data into a format it can interpret. I have proposed a work around, but I'm wondering if there is a better/more Pythonic way of accomplishing this task.

Thanks in advance, Myles

import pandas as pd
import numpy as np

test = pd.Series(data = [NaN, 2, u'string'])
np.isnan(test).sum()
#Error

#Work around
test2 = [x for x in test if not(isinstance(x, unicode))]
numNaNs = np.isnan(test2).sum()

998

asked Feb 26 '14 13:02

Myles Baker

1 Answers

Use pandas.isnull:

In [24]: test = pd.Series(data = [NaN, 2, u'string'])

In [25]: pd.isnull(test)
Out[25]: 
0     True
1    False
2    False
dtype: bool

Note however, that pd.isnull also regards None as True:

In [28]: pd.isnull([NaN, 2, u'string', None])
Out[28]: array([ True, False, False,  True], dtype=bool)

154

answered Sep 20 '22 18:09

unutbu

Related questions
                            
                                What is the equivalent of psycopg curs.mogrify on mysql?
                            
                                Error in SQLAlchemy with Integer: "object() takes no parameters"
                            
                                functools.partial and generators
                            
                                While generating all possible combinations itertools.combinations_with_replacement() vs itertools.product()?
                            
                                Appending to a DataFrame converts dtypes
                            
                                How to find set of most frequently occurring word-pairs in a file using python?
                            
                                Is there a bug in binning in matplotlib histograms? Or non-randomness of the rvs method in scipy.stats
                            
                                Change color implicit plot
                            
                                Python List in a For Loop
                            
                                Efficiently set row in SciPy sparse.lil_matrix?
                            
                                Is there a foolproof way to give the system enough time to delete a folder before running copytree
                            
                                Return a dict object from Jinja2 macros
                            
                                Expected string or buffer (in re.sub)
                            
                                Python - The Standard Library - ascii( ) Function
                            
                                Access the response object in a bottlepy after_request hook
                            
                                Regression using PYMC3
                            
                                What Series method replaced searchsorted?
                            
                                IRC bot in python won't send messages
                            
                                Combine groups after iteration
                            
                                Finding all possible substrings within a string. Python Regex

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Count NaNs when unicode values present

Tags:

python

pandas

python-unicode

nan

numpy

Myles Baker

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us