python pandas resample count and sum

Tags:

I have data by date and want to create a new dataframe by week with sum of sales and count of categories.

#standard packages
import numpy as np
import pandas as pd

#visualization
%matplotlib inline
import matplotlib.pylab as plt

#create weekly datetime index
edf = pd.read_csv('C:\Users\j~\raw.csv', parse_dates=[6])
edf2 = edf[['DATESENT','Sales','Category']].copy()
edf2

#output

DATESENT    |  SALES  | CATEGORY
2014-01-04      100        A
2014-01-05      150        B
2014-01-07      150        C
2014-01-10      175        D

#create datetime index of week
edf2['DATESENT']=pd.to_datetime(edf2['DATESENT'],format='%m/%d/%Y')
edf2 = edf2.set_index(pd.DatetimeIndex(edf2['DATESENT']))
edf2.resample('w').sum()
edf2

#output

            SALES CATEGORY 
DATESENT     
2014-01-05  250      AB
2014-01-12  325      CD

But I am looking for

           SALES CATEGORY 
DATESENT     
2014-01-05  250      2
2014-01-12  325      2

This didn't work ...

edf2 = e2.resample('W').agg("Category":len,"Sales":np.sum)

Thank you

620

asked Mar 21 '17 21:03

jeangelj

2 Answers

Agg takes a dictionary as arguments in various formats.

edf2 = e2.resample('W').agg({"Category":'size',"Sales":'sum'})

178

answered Sep 19 '22 01:09

Scott Boston

using pd.TimeGrouper + agg

f = {'SALES': 'sum', 'CATEGORY': 'count'}
g = pd.TimeGrouper('W')
df.set_index('DATESENT').groupby(g).agg(f)

            CATEGORY  SALES
DATESENT                   
2014-01-05         2    250
2014-01-12         2    325

answered Sep 20 '22 01:09

piRSquared

Related questions
                            
                                Summation of elements of dictionary that are list of lists
                            
                                Adding attachment to Slackbot
                            
                                How to use numpy to get the cumulative count by unique values in linear time?
                            
                                Sum of Every Two Columns in Pandas dataframe
                            
                                How to properly create a HeatMap with Bokeh
                            
                                List comprehension equivalent to map on two lists in parallel [duplicate]
                            
                                Summing multiple rows having duplicate columns pandas [duplicate]
                            
                                Running unit tests on the "Flaskr" tutorial micro-blogging app in Flask
                            
                                Comma separated variable assignment [duplicate]
                            
                                Reinstalling numpy on OS X using pip - "can’t be modified or deleted because it’s required by OS X"
                            
                                Flask AttributeError: module 'app' has no attribute 'run'
                            
                                Permission denied when installing Pylint on VSCode
                            
                                Find common elements in list of lists
                            
                                Getting pandas dataframe from list of nested dictionaries
                            
                                using bisect on list of tuples but compare using first value only
                            
                                How to use cursors in Odoo?
                            
                                pandas group by product instead of sum or count
                            
                                scatter plots in seaborn/matplotlib with point size and color given by continuous dataframe column
                            
                                [python][selenium] on-screen position of element
                            
                                Multiple constructors in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python pandas resample count and sum

Tags:

python

datetime

indexing

pandas

jeangelj

People also ask

2 Answers

Scott Boston

piRSquared

Recent Activity

Donate For Us