group by time and other column in pandas

Tags:

pandas

I have a large pandas dataframe containing columns timestamp, name, and value

index    timestamp                     name   value
0        1999-12-31 23:59:59.000107    A      16
1        1999-12-31 23:59:59.000385    B      12
2        1999-12-31 23:59:59.000404    C      25 
3        1999-12-31 23:59:59.000704    B      15
4        1999-12-31 23:59:59.001281    A      300
5        1999-12-31 23:59:59.002211    C      20
6        1999-12-31 23:59:59.002367    C      3

I want to group by time buckets (say 20ms or 20 minutes) and name, and calculate the average value for each group.

What is the most efficient manner to do it?

941

asked Mar 09 '16 17:03

volatile

1 Answers

You can use pd.Grouper, but it requires you to have the timestamps on the index. So you could try something like:

df.set_index('timestamp').groupby([pd.Grouper(freq='20Min'), 'name']).mean()

187

answered Sep 29 '22 16:09

Gustavo Bezerra

Related questions
                            
                                Convenient way to deal with ValueError: cannot reindex from a duplicate axis
                            
                                Filter pandas row where 1st letter in a column is/is-not a certain value
                            
                                How to install Numpy and Pandas for AWS Lambdas?
                            
                                Matching two people together based on attributes
                            
                                How to calculate cumulative groupby counts in Pandas with point in time?
                            
                                Find columns where at least one row contains an alphabetical letter
                            
                                No "from_csv" method in pandas
                            
                                How to select only few columns in scikit learn column selector pipeline?
                            
                                How to speed up pandas apply for string matching
                            
                                Split dataframe with all values in one row
                            
                                Share/percent across a list of columns in a pandas agg
                            
                                Pandas hierarchical dataframe
                            
                                pd.to_datetime change date format producing wrong dates
                            
                                Python - Aggregate by month and calculate average
                            
                                pandas: Get combination of columns where correlation is high
                            
                                python pandas: drop a df column if condition
                            
                                how does one make lines thicker in pandas subplots
                            
                                How to transform a huge CSV into SQLite using Pandas?
                            
                                Using pandas to_datetime with timestamps
                            
                                printing list of categories as a column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With