Why can't I use an offset when <code>rolling</code> a multi-index DataFrame? For example, with: <pre class="prettyprint"><code>rng = pd.date_range('2017-01-03', periods=20, freq='8D') i = pd.MultiIndex.from_product([['A','B','C'], rng], names=['Name','Date']) df = pd.DataFrame(np.random.randn(60), i, columns=['Vals']) </code></pre> If I try grouping and rolling with an offset I get "ValueError: window must be an integer": <pre class="prettyprint"><code>df['Avg'] = df.groupby(['Name'])['Vals'].rolling('30D').mean() # << Why doesn't this work? </code></pre> Not that these following variants meet my needs, but note that grouping and rolling with an <code>int</code> works: <pre class="prettyprint"><code>df['Avg'] = df.groupby(['Name'])['Vals'].rolling(4).mean() </code></pre> And I can roll with an offset on a single-index subset of the DataFrame: <pre class="prettyprint"><code>d = df.loc['A'] d['Avg'] = d['Vals'].rolling('30D').mean() </code></pre> If it's truly impossible to do rolling with offsets on multi-index DataFrames, what would be the most efficient workaround to apply one to each level-0 index item?

In order to use an offset like '30D' you need a simple date index. In this case the simplest way to achieve that is to move 'Name' out of the index with <code>reset_index(level='Name')</code>, leaving you with only 'Date' as the index: <pre class="prettyprint"><code>df['Avg'] = df.reset_index(level='Name').groupby(['Name'])['Vals'].rolling('30D').mean() </code></pre>

Pandas MultiIndex DataFrame.rolling offset

Tags:

python

pandas

dataframe

aggregate

multi-index

Why can't I use an offset when rolling a multi-index DataFrame? For example, with:

Click to copy

rng = pd.date_range('2017-01-03', periods=20, freq='8D')
i = pd.MultiIndex.from_product([['A','B','C'], rng], names=['Name','Date'])
df = pd.DataFrame(np.random.randn(60), i, columns=['Vals'])

If I try grouping and rolling with an offset I get "ValueError: window must be an integer":

Click to copy

df['Avg'] = df.groupby(['Name'])['Vals'].rolling('30D').mean() # << Why doesn't this work?

Not that these following variants meet my needs, but note that grouping and rolling with an int works:

Click to copy

df['Avg'] = df.groupby(['Name'])['Vals'].rolling(4).mean()

And I can roll with an offset on a single-index subset of the DataFrame:

Click to copy

d = df.loc['A']
d['Avg'] = d['Vals'].rolling('30D').mean()

If it's truly impossible to do rolling with offsets on multi-index DataFrames, what would be the most efficient workaround to apply one to each level-0 index item?

500

asked Feb 23 '18 21:02

feetwet

1 Answers

In order to use an offset like '30D' you need a simple date index. In this case the simplest way to achieve that is to move 'Name' out of the index with reset_index(level='Name'), leaving you with only 'Date' as the index:

Click to copy

df['Avg'] = df.reset_index(level='Name').groupby(['Name'])['Vals'].rolling('30D').mean()

answered Oct 12 '22 12:10

JohnE

Related questions
                            
                                pandas DataFrame.query expression that returns all rows by default
                            
                                How to uniformly resample a non-uniform signal using SciPy?
                            
                                pygame.mixer.music.play() doesn't recognize Fast Tracker (.xm music format) repeat position
                            
                                PyGILState_Ensure() Causing Deadlock
                            
                                Is installing NodeJS packages locally equivalent to Python's virtualenv?
                            
                                is there a simple way to use features from tf.data.Dataset.from_generator with a custom model_fn(Estimator) in tensorflow
                            
                                Different virtualenv's on one Jupyter notebook
                            
                                Iterate through columns in Read-only workbook in openpyxl
                            
                                Running flask as package in production
                            
                                How to use priority in celery task.apply_async
                            
                                JSONDecodeError using Google Translate API with Python3
                            
                                passing args and kwargs to parent class with extra content in django CreateView
                            
                                Matplotlib animations do not work in PyCharm
                            
                                Numpy isnat() returns value error on datetime objects
                            
                                How to Normalize data with NaN values in python
                            
                                Multiple classification models in a scikit pipeline python
                            
                                How to read parquet file with a condition using pyarrow in Python
                            
                                Better way to compute floor of log(n,b) for integers n and b?
                            
                                Locust: how I share auth cookie with the rest of tasks only for current locust user?
                            
                                Tensorflow `tf.layers.batch_normalization` doesn't add update ops to `tf.GraphKeys.UPDATE_OPS`

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas MultiIndex DataFrame.rolling offset

Tags:

python

pandas

dataframe

aggregate

multi-index

feetwet

People also ask

1 Answers

JohnE

Recent Activity

Donate For Us