Why does pandas roll a week forward when using resample with W-MON frequency?

Tags:

As example, I have the following code which creates a dataframe with an index containing a single value - the date '2018-03-06' (a Tuesday). Note that this date falls in the week of 2018-03-05 (a Monday):

values = [1, 1, 1]
dates = pd.to_datetime(np.repeat('2018-03-06', 3))
df = pd.DataFrame({
    'value': values
}, index=dates)
df.resample('W-MON').size()

which produces:

2018-03-12    3
Freq: W-MON, dtype: int64

Why does pandas roll the date forward one week? I would have expected the result to have been resampled to 2018-03-05 since that is the week during which the values were generated and I'm using freq='W-MON'.

UPDATE

As was pointed out, I needed to add the label argument to resample which defines which bin edge to use. Using label='left' solves the problem of bucketing the dates in the correct week except when the date falls on the start of the week (in this case, Monday). For example, if I apply resample to the date 2018-03-05 using label='left' then the resampled value is 2018-02-26 when it should be 2018-03-05.

280

asked Mar 28 '18 21:03

cdlm

2 Answers

Let's try using label and closed see docs:

values = [1, 1, 1]
dates = pd.to_datetime(np.repeat('2018-03-06', 3))
df = pd.DataFrame({
    'value': values
}, index=dates)
df.resample('W-MON', label='left',closed='left').size()

Output:

2018-03-05    3
Freq: W-MON, dtype: int64

And,

values = [1, 1, 1]
dates = pd.to_datetime(np.repeat('2018-03-05', 3))
df = pd.DataFrame({
    'value': values
}, index=dates)
df.resample('W-MON', label='left',closed='left').size()

Output:

2018-03-05    3
Freq: W-MON, dtype: int64

Interesting note about the docs, the signature states that 'closed' defaults to None. However, the docstring states that 'closed' default 'left'.

114

answered Oct 28 '22 07:10

Scott Boston

I'm not sure why it's done this way and I agree that the behaviour you expected seems more intuitive. You can get your desired result by passing label='left' as a keyword parameter. The default value in this case was 'right'.

df.resample('W-MON', label='left').size()

From the documentation:

label : {‘right’, ‘left’}

Which bin edge label to label bucket with. The default is ‘left’ for all frequency offsets except for ‘M’, ‘A’, ‘Q’, ‘BM’, ‘BA’, ‘BQ’, and ‘W’ which all have a default of ‘right’.

I guess 'W-MON' still counts as 'W' which is why the default is 'right' and therefore your example gave a result of '2018-03-12' rather than '2018-03-05'.

answered Oct 28 '22 05:10

sjw

Related questions
                            
                                Extract class name in scrapy
                            
                                Dask delayed object of unspecified length not iterable error when combining dictionaries
                            
                                How can I call multiple views in one url address in Django?
                            
                                How to set a Tkinter widget to a monospaced, platform independent font?
                            
                                Python and C++ sharing the same memory resources
                            
                                Numpy: Fastest way to insert value into array such that array's in order
                            
                                Jupyter pandas.DataFrame output table format configuration
                            
                                pandas: Replicate / Broadcast single indexed DataFrame on MultiIndex DataFrame: HowTo and Memory Efficiency
                            
                                Is it safe that when Two asyncio tasks access the same awaitable object?
                            
                                Keras - .flow_from_directory(directory)
                            
                                Multinomial Logit model Python and Stata different results
                            
                                TypeError: unsupported operand type(s) for +: 'map' and 'float'
                            
                                Python add two sets and delete duplicate elements
                            
                                Error with opencv clahe.apply()
                            
                                Pyspark- Subquery in a case statement
                            
                                Vectorized dictionary in Python
                            
                                Error Trying to initialize Dash in Spyder IPython Console
                            
                                Why does defining tf.Session with and without context manager in Tensorflow result in different behaviour?
                            
                                How to specify that an attribute must be a list of (say) integers, not just a list?
                            
                                SSL: CERTIFICATE_VERIFY_FAILED error from Python pip in Ubuntu 16.0.4

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does pandas roll a week forward when using resample with W-MON frequency?

Tags:

python

pandas

cdlm

People also ask

2 Answers

Scott Boston

sjw

Recent Activity

Donate For Us