Parsing dates in pandas.read_csv with null-value handling?

Tags:

Consider the following made-up CSV:

from io import StringIO

data = """value,date
7,null
7,10/18/2008
621,(null)"""

fake_file = StringIO(data)

I want to read this file using pandas.read_csv, handling nulls with the na_values parameter and dates with parse_dates and date_parser:

import pandas as pd

date_parser = lambda c: pd.datetime.strptime(c, '%m/%d/%Y')

df = pd.read_csv(fake_file,
                 parse_dates=['date'],
                 date_parser=date_parser,
                 na_values=['null', '(null)'])

Running this code in Python 3.5 gives me this:

  File "<ipython-input-11-aa5bcf0858b7>", line 1, in <lambda>
    date_parser = lambda c: pd.datetime.strptime(c, DATE_FMT)

TypeError: strptime() argument 1 must be str, not float

So it seems the nulls are handled first and then the dates are attempted to be parsed...

I know I can do this:

df = pd.read_csv(fake_file,
                 na_values=['null', '(null)'])
df['date'] = pd.to_datetime(df['date'],
                            format='%m/%d/%Y')

But my real question is how to both handle date formatting and NaN-handling in one fell swoop...

458

asked Oct 03 '17 13:10

blacksite

1 Answers

Use to_datetime with format and errors='coerce':

date_parser = lambda c: pd.to_datetime(c, format='%m/%d/%Y', errors='coerce')
df = pd.read_csv(fake_file, parse_dates=['date'], date_parser=date_parser)
print (df)
   value       date
0      7        NaT
1      7 2008-10-18
2    621        NaT

145

answered Sep 28 '22 00:09

jezrael

Related questions
                            
                                Scraper in Python gives "Access Denied"
                            
                                Python: numpy array larger and smaller than a value
                            
                                Can python fastparquet module read in compressed parquet file?
                            
                                How to translate(or shift) images in tensorflow
                            
                                Python 3.6.0: 'os' module does not have 'sched_getaffinity' method
                            
                                Extended interpolation not working in configparser
                            
                                Performance is slow when replacing a string in a pandas dataframe using a dict
                            
                                Django static files 404 (Not Found)
                            
                                How Can Python Handle systemctl stop?
                            
                                python: module 'Crypto.Cipher.AES' has no attribute 'MODE_CCM' even though pycrypto installed
                            
                                Google assistant "No module named googles...."
                            
                                Overriding Djangorest ViewSets Delete Behavior
                            
                                How to fetch all the child nodes of an XML using python?
                            
                                Spacy to extract specific noun phrase
                            
                                Pyspark Removing null values from a column in dataframe
                            
                                Change row values in specific pandas data frame column with python
                            
                                'WinError 10013' running Django on Windows
                            
                                What is the difference between base64 and MIME base 64? [closed]
                            
                                How to strip unicode in a list
                            
                                Concatenate multiple pandas series efficiently

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Parsing dates in pandas.read_csv with null-value handling?

Tags:

python

null

pandas

blacksite

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us