After fighting with NumPy and dateutil for days, I recently discovered the amazing Pandas library. I've been poring through the documentation and source code, but I can't figure out how to get <code>date_range()</code> to generate indices at the right breakpoints. <pre class="prettyprint"><code>from datetime import date import pandas as pd start = date('2012-01-15') end = date('2012-09-20') # 'M' is month-end, instead I need same-day-of-month date_range(start, end, freq='M') </code></pre> What I want: <pre class="prettyprint"><code>2012-01-15 2012-02-15 2012-03-15 ... 2012-09-15 </code></pre> What I get: <pre class="prettyprint"><code>2012-01-31 2012-02-29 2012-03-31 ... 2012-08-31 </code></pre> I need month-sized chunks that account for the variable number of days in a month. This is possible with dateutil.rrule: <pre class="prettyprint"><code>rrule(freq=MONTHLY, dtstart=start, bymonthday=(start.day, -1), bysetpos=1) </code></pre> Ugly and illegible, but it works. How can do I this with pandas? I've played with both <code>date_range()</code> and <code>period_range()</code>, so far with no luck. My actual goal is to use <code>groupby</code>, <code>crosstab</code> and/or <code>resample</code> to calculate values for each period based on sums/means/etc of individual entries within the period. In other words, I want to transform data from: <pre class="prettyprint"><code> total 2012-01-10 00:01 50 2012-01-15 01:01 55 2012-03-11 00:01 60 2012-04-28 00:01 80 #Hypothetical usage dataframe.resample('total', how='sum', freq='M', start='2012-01-09', end='2012-04-15') </code></pre> to <pre class="prettyprint"><code> total 2012-01-09 105 # Values summed 2012-02-09 0 # Missing from dataframe 2012-03-09 60 2012-04-09 0 # Data past end date, not counted </code></pre> Given that Pandas originated as a financial analysis tool, I'm virtually certain that there's a simple and fast way to do this. Help appreciated!

There actually is no "day of month" frequency (e.g. "DOMXX" like "DOM09"), but I don't see any reason not to add one. http://github.com/pydata/pandas/issues/2289 I don't have a simple workaround for you at the moment because <code>resample</code> requires passing a known frequency rule. I think it should be augmented to be able to take any date range to be used as arbitrary bin edges, also. Just a matter of time and hacking...

Date ranges in Pandas

Tags:

After fighting with NumPy and dateutil for days, I recently discovered the amazing Pandas library. I've been poring through the documentation and source code, but I can't figure out how to get date_range() to generate indices at the right breakpoints.

Click to copy

from datetime import date import pandas as pd  start = date('2012-01-15') end = date('2012-09-20') # 'M' is month-end, instead I need same-day-of-month date_range(start, end, freq='M')

What I want:

Click to copy

2012-01-15 2012-02-15 2012-03-15 ... 2012-09-15

What I get:

Click to copy

2012-01-31 2012-02-29 2012-03-31 ... 2012-08-31

I need month-sized chunks that account for the variable number of days in a month. This is possible with dateutil.rrule:

Click to copy

rrule(freq=MONTHLY, dtstart=start, bymonthday=(start.day, -1), bysetpos=1)

Ugly and illegible, but it works. How can do I this with pandas? I've played with both date_range() and period_range(), so far with no luck.

My actual goal is to use groupby, crosstab and/or resample to calculate values for each period based on sums/means/etc of individual entries within the period. In other words, I want to transform data from:

Click to copy

                total 2012-01-10 00:01    50 2012-01-15 01:01    55 2012-03-11 00:01    60 2012-04-28 00:01    80  #Hypothetical usage dataframe.resample('total', how='sum', freq='M', start='2012-01-09', end='2012-04-15')

Click to copy

                total 2012-01-09          105 # Values summed 2012-02-09          0   # Missing from dataframe 2012-03-09          60 2012-04-09          0   # Data past end date, not counted

Given that Pandas originated as a financial analysis tool, I'm virtually certain that there's a simple and fast way to do this. Help appreciated!

999

asked Nov 18 '12 22:11

knite

2 Answers

freq='M' is for month-end frequencies (see here). But you can use .shift to shift it by any number of days (or any frequency for that matter):

Click to copy

pd.date_range(start, end, freq='M').shift(15, freq=pd.datetools.day)

answered Dec 10 '22 10:12

Matti John

There actually is no "day of month" frequency (e.g. "DOMXX" like "DOM09"), but I don't see any reason not to add one.

http://github.com/pydata/pandas/issues/2289

I don't have a simple workaround for you at the moment because resample requires passing a known frequency rule. I think it should be augmented to be able to take any date range to be used as arbitrary bin edges, also. Just a matter of time and hacking...

answered Dec 10 '22 11:12

Wes McKinney

Related questions
                            
                                Understanding PTS and DTS in video frames
                            
                                SQL Performance: SELECT DISTINCT versus GROUP BY
                            
                                Html5 number input step and precision
                            
                                Python why would you use [:] over =
                            
                                CakePHP Session Timeout on Inactivity only
                            
                                How to move the Android Google Maps API Compass Position
                            
                                scrapy- how to stop Redirect (302)
                            
                                How to write Reads[T] and Writes[T] in scala Enumeration (play framework 2.1)
                            
                                Mockito UnfinishedStubbingException
                            
                                How to detect double precision floating point overflow and underflow?
                            
                                Coerce to number
                            
                                Do Local Notifications need user permission on iOS?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Date ranges in Pandas

Tags:

knite

People also ask

2 Answers

Matti John

Wes McKinney

Recent Activity

Donate For Us