Faster way to groupby time of day in pandas

Tags:

I have a time series of several days of 1-minute data, and would like to average it across all days by time of day.

This is very slow:

from datetime import datetime
from pandas import date_range, Series
time_ind = date_range(datetime(2013, 1, 1), datetime(2013, 1, 10), freq='1min')
all_data = Series(randn(len(time_ind)), time_ind)
time_mean = all_data.groupby(lambda x: x.time()).mean()

Takes almost a minute to run!

While something like:

time_mean = all_data.groupby(lambda x: x.minute).mean()

takes only a fraction of a second.

Is there a faster way to group by time of day?

Any idea why this is so slow?

777

asked Jun 25 '13 03:06

joeb1415

1 Answers

Both your "lambda-version" and the time property introduced in version 0.11 seems to be slow in version 0.11.0:

In [4]: %timeit all_data.groupby(all_data.index.time).mean()
1 loops, best of 3: 11.8 s per loop

In [5]: %timeit all_data.groupby(lambda x: x.time()).mean()
Exception RuntimeError: 'maximum recursion depth exceeded while calling a Python object' in <type 'exceptions.RuntimeError'> ignored
Exception RuntimeError: 'maximum recursion depth exceeded while calling a Python object' in <type 'exceptions.RuntimeError'> ignored
Exception RuntimeError: 'maximum recursion depth exceeded while calling a Python object' in <type 'exceptions.RuntimeError'> ignored
1 loops, best of 3: 11.8 s per loop

With the current master both methods are considerably faster:

In [1]: pd.version.version
Out[1]: '0.11.1.dev-06cd915'

In [5]: %timeit all_data.groupby(lambda x: x.time()).mean()
1 loops, best of 3: 215 ms per loop

In [6]: %timeit all_data.groupby(all_data.index.time).mean()
10 loops, best of 3: 113 ms per loop
'0.11.1.dev-06cd915'

So you can either update to a master or wait for 0.11.1 which should be released this month.

179

answered Sep 28 '22 07:09

bmu

Related questions
                            
                                How do I remove duplicate arrays in a list in Python
                            
                                Import a python module without .py extension, [duplicate]
                            
                                Conditionally include extensions?
                            
                                How to integrate a python library into a Ruby on Rails application
                            
                                How can I have multiple objects moving at once in PYGAME
                            
                                Why is fabric using /bin/sh
                            
                                Generate all base26 triplests in the fastest way
                            
                                Kronecker product in Python and Matlab
                            
                                Python: I can't import a module even though it's in site-packages
                            
                                Python's IDLE behavior while defining fractional default values to function parameters
                            
                                Display an OpenCV video in tkinter using multiprocessing
                            
                                Mimic Python (pure) virtual functions like C#
                            
                                Python shell in Emacs freezes when using matplotlib
                            
                                Passing arguments to singletons in python
                            
                                Memory error reading a zip file in python
                            
                                Python class definition based on a condition
                            
                                Programmatically talking to a Serial Port in OS X or Linux
                            
                                Fastest way to sort a python 3.7+ dictionary
                            
                                Serving Flask app with waitress on windows
                            
                                Executing functions within switch dictionary

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Faster way to groupby time of day in pandas

Tags:

python

datetime

time

pandas

group-by

joeb1415

People also ask

1 Answers

bmu

Recent Activity

Donate For Us