Time-weighted average with Pandas

Tags:

What's the most efficient way to calculate the time-weighted average of a TimeSeries in Pandas 0.8? For example, say I want the time-weighted average of df.y - df.x as created below:

Click to copy

import pandas
import numpy as np
times = np.datetime64('2012-05-31 14:00') + np.timedelta64(1, 'ms') * np.cumsum(10**3 * np.random.exponential(size=10**6))
x = np.random.normal(size=10**6)
y = np.random.normal(size=10**6)
df = pandas.DataFrame({'x': x, 'y': y}, index=times)

I feel like this operation should be very easy to do, but everything I've tried involves several messy and slow type conversions.

741

asked May 31 '12 19:05

user2303

1 Answers

You can convert df.index to integers and use that to compute the average. There is a shortcut asi8 property that returns an array of int64 values:

Click to copy

np.average(df.y - df.x, weights=df.index.asi8)

answered Oct 18 '22 22:10

Wes McKinney

Related questions
                            
                                Reliable way to only get the email text, excluding previous emails
                            
                                python: os.path.isdir return false for directory with dot on end
                            
                                Caching of (fake) static content which is actually dynamic on GAE for Python
                            
                                How to prevent embedded python to exit() my process
                            
                                libusb-1.x VS openUsb
                            
                                Extracting paragraph breaks from OCR text?
                            
                                Python replacements for RVM/Bundler/Capistrano
                            
                                Buffers and Memoryview Objects explained for the non-C programmer
                            
                                sphinx.ext.autodoc: Keeping names of constants in signature
                            
                                Controlling the tracker when using twinx
                            
                                Why do indented explicit line continuations not allow comments in Python?
                            
                                django: select_related() on an already-existing object?
                            
                                Python IndentationError: too many levels of indentation
                            
                                Importing MonkeyRunner into Python script fails in Windows
                            
                                Urllib.urlopen() works on SSLv3 urls with Python 2.6.6 on 1 machine, but not with 2.6.7/2.7.2 on another
                            
                                .coveragerc file location when running py.test
                            
                                Wrapping C function in Cython and NumPy
                            
                                Git pre-commit hook: getting list of changed files
                            
                                How to use Mathematica functions in Python programs? [closed]
                            
                                PayPal IPN POST request encoding

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Time-weighted average with Pandas

Tags:

python

pandas

time-series

user2303

People also ask

1 Answers

Wes McKinney

Recent Activity

Donate For Us