How to test for nan's in an apply function in pandas?

Tags:

I have a simple apply function that I execute on some of the columns. But, it keeps getting tripped up by NaN values in pandas.

input_data = np.array(
[
[random.randint(0,9) for x in range(2)]+['']+['g'],
[random.randint(0,9) for x in range(3)]+['g'],
[random.randint(0,9) for x in range(3)]+['a'],
[random.randint(0,9) for x in range(3)]+['b'],
[random.randint(0,9) for x in range(3)]+['b']
]
)

input_df = pd.DataFrame(data=input_data, columns=['B', 'C', 'D', 'label'])

I have a simple lambda like this:

input_df['D'].apply(lambda aCode: re.sub('\.', '', aCode) if not np.isnan(aCode) else aCode)

And it gets tripped up by the NaN values:

File "<pyshell#460>", line 1, in <lambda>
    input_df['D'].apply(lambda aCode: re.sub('\.', '', aCode) if not np.isnan(aCode) else aCode)
TypeError: Not implemented for this type

So, I tried just testing for nan values that Pandas adds:

np.isnan(input_df['D'].values[0])
np.isnan(input_df['D'].iloc[0])

Both get the same error.

I do not know how to test for nan values other than np.isnan. Is there an easier way to do this? Thanks.

751

asked Feb 05 '16 20:02

makansij

Video Answer

1 Answers

your code fails because your first entry is an empty string and np.isnan doesn't understand empty strings:

In [55]:
input_df['D'].iloc[0]

Out[55]:
''

In [56]:
np.isnan('')

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-56-a9f139a0c5b8> in <module>()
----> 1 np.isnan('')

TypeError: Not implemented for this type

ps.notnull does work:

In [57]:
import re
input_df['D'].apply(lambda aCode: re.sub('\.', '', aCode) if pd.notnull(aCode) else aCode)

Out[57]:
0     
1    3
2    3
3    0
4    3
Name: D, dtype: object

However, if you just want to replace something then just use .str.replace:

In [58]:
input_df['D'].str.replace('\.','')

Out[58]:
0     
1    3
2    3
3    0
4    3
Name: D, dtype: object

188

answered Oct 02 '22 23:10

EdChum

Related questions
                            
                                Logging using elasticsearch-py
                            
                                Grouping by everything except for one index column in pandas
                            
                                setup.py doesn't see my requirements.txt
                            
                                Sqlite python sqlite3.OperationalError: database is locked
                            
                                Using boto for AWS S3 Buckets for Signature V4
                            
                                Deterministic python script behaves in non-deterministic way
                            
                                Dynamically add to list of what Python asyncio's event loop should execute
                            
                                pandas save date in ISO format?
                            
                                django-rest-framework HyperlinkedIdentityField with multiple lookup args
                            
                                Install mysql in dockerfile?
                            
                                Geo Django get cities from latitude and longitude
                            
                                Why re.escape escapes space
                            
                                'n' in pdb moves me inside of the pdb.set_trace() method
                            
                                windows7 64bit python pip install error: Unable to find vcvarsall.bat
                            
                                Python: Check if a /dev/disk device exists
                            
                                Scrape tweets by tweet location and user location
                            
                                How to correctly convert MIDI ticks to milliseconds?
                            
                                Django - change field validation message
                            
                                Using HTML5 fields with WTForms
                            
                                How to replace only the first n elements in a numpy array that are larger than a certain value?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to test for nan's in an apply function in pandas?

Tags:

python

pandas

dataframe

nan

makansij

People also ask

Video Answer

1 Answers

EdChum

Recent Activity

Donate For Us