Essentially equivalent to @waitingkuo, but I would use <code>pd.to_datetime</code> here (it seems a little cleaner, and offers some additional functionality e.g. <code>dayfirst</code>): <pre class="prettyprint"><code>In [11]: df Out[11]: a time 0 1 2013-01-01 1 2 2013-01-02 2 3 2013-01-03 In [12]: pd.to_datetime(df['time']) Out[12]: 0 2013-01-01 00:00:00 1 2013-01-02 00:00:00 2 2013-01-03 00:00:00 Name: time, dtype: datetime64[ns] In [13]: df['time'] = pd.to_datetime(df['time']) In [14]: df Out[14]: a time 0 1 2013-01-01 00:00:00 1 2 2013-01-02 00:00:00 2 3 2013-01-03 00:00:00 </code></pre> <hr> Handling <code>ValueError</code>s If you run into a situation where doing <pre class="prettyprint"><code>df['time'] = pd.to_datetime(df['time']) </code></pre> Throws a <pre class="prettyprint"><code>ValueError: Unknown string format </code></pre> That means you have invalid (non-coercible) values. If you are okay with having them converted to <code>pd.NaT</code>, you can add an <code>errors='coerce'</code> argument to <code>to_datetime</code>: <pre class="prettyprint"><code>df['time'] = pd.to_datetime(df['time'], errors='coerce') </code></pre> Use astype <pre class="prettyprint"><code>In [31]: df Out[31]: a time 0 1 2013-01-01 1 2 2013-01-02 2 3 2013-01-03 In [32]: df['time'] = df['time'].astype('datetime64[ns]') In [33]: df Out[33]: a time 0 1 2013-01-01 00:00:00 1 2 2013-01-02 00:00:00 2 3 2013-01-03 00:00:00 </code></pre> I imagine a lot of data comes into Pandas from CSV files, in which case you can simply convert the date during the initial CSV read: <code>dfcsv = pd.read_csv('xyz.csv', parse_dates=[0])</code> where the 0 refers to the column the date is in. You could also add <code>, index_col=0</code> in there if you want the date to be your index. See https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html Now you can do <code>df['column'].dt.date</code> Note that for datetime objects, if you don't see the hour when they're all 00:00:00, that's not pandas. That's iPython notebook trying to make things look pretty. If you want to get the DATE and not DATETIME format: <pre class="prettyprint"><code>df["id_date"] = pd.to_datetime(df["id_date"]).dt.date </code></pre>

How do I convert strings in a Pandas data frame to a 'date' data type?

Tags:

python

date

pandas

Essentially equivalent to @waitingkuo, but I would use pd.to_datetime here (it seems a little cleaner, and offers some additional functionality e.g. dayfirst):

In [11]: df
Out[11]:
   a        time
0  1  2013-01-01
1  2  2013-01-02
2  3  2013-01-03

In [12]: pd.to_datetime(df['time'])
Out[12]:
0   2013-01-01 00:00:00
1   2013-01-02 00:00:00
2   2013-01-03 00:00:00
Name: time, dtype: datetime64[ns]

In [13]: df['time'] = pd.to_datetime(df['time'])

In [14]: df
Out[14]:
   a                time
0  1 2013-01-01 00:00:00
1  2 2013-01-02 00:00:00
2  3 2013-01-03 00:00:00

Handling ValueErrors
If you run into a situation where doing

df['time'] = pd.to_datetime(df['time'])

Throws a

ValueError: Unknown string format

That means you have invalid (non-coercible) values. If you are okay with having them converted to pd.NaT, you can add an errors='coerce' argument to to_datetime:

df['time'] = pd.to_datetime(df['time'], errors='coerce')

Use astype

In [31]: df
Out[31]: 
   a        time
0  1  2013-01-01
1  2  2013-01-02
2  3  2013-01-03

In [32]: df['time'] = df['time'].astype('datetime64[ns]')

In [33]: df
Out[33]: 
   a                time
0  1 2013-01-01 00:00:00
1  2 2013-01-02 00:00:00
2  3 2013-01-03 00:00:00

I imagine a lot of data comes into Pandas from CSV files, in which case you can simply convert the date during the initial CSV read:

dfcsv = pd.read_csv('xyz.csv', parse_dates=[0]) where the 0 refers to the column the date is in.
You could also add , index_col=0 in there if you want the date to be your index.

See https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html

Now you can do df['column'].dt.date

Note that for datetime objects, if you don't see the hour when they're all 00:00:00, that's not pandas. That's iPython notebook trying to make things look pretty.

If you want to get the DATE and not DATETIME format:

df["id_date"] = pd.to_datetime(df["id_date"]).dt.date

Related questions
                            
                                Move column by name to front of table in pandas
                            
                                Using Python String Formatting with Lists
                            
                                How do I exchange keys with values in a dictionary?
                            
                                Python: Making a beep noise
                            
                                Return datetime object of previous month
                            
                                How to compile python script to binary executable
                            
                                How to pickle or store Jupyter (IPython) notebook session for later
                            
                                What does a b prefix before a python string mean?
                            
                                Regular expression matching a multiline block of text
                            
                                Making an asynchronous task in Flask
                            
                                SQLAlchemy: how to filter date field?
                            
                                Pass parameter to fabric task
                            
                                What does 'wb' mean in this code, using Python?
                            
                                Getting only 1 decimal place [duplicate]
                            
                                Checking if sys.argv[x] is defined
                            
                                Select between two dates with Django
                            
                                How to remove stop words using nltk or python
                            
                                error: (-215) !empty() in function detectMultiScale
                            
                                In Python, how do I use urllib to see if a website is 404 or 200?
                            
                                How to schedule a function to run every hour on Flask?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With