I would like to calculate the <code>mean</code> and <code>standard deviation</code> of a <code>timedelta</code> by bank from a <code>dataframe</code> with two columns shown below. When I run the code (also shown below) I get the below error: <pre class="prettyprint"><code>pandas.core.base.DataError: No numeric types to aggregate </code></pre> My dataframe: <pre class="prettyprint"><code> bank diff Bank of Japan 0 days 00:00:57.416000 Reserve Bank of Australia 0 days 00:00:21.452000 Reserve Bank of New Zealand 55 days 12:39:32.269000 U.S. Federal Reserve 8 days 13:27:11.387000 </code></pre> My code: <pre class="prettyprint"><code>means = dropped.groupby('bank').mean() std = dropped.groupby('bank').std() </code></pre>

Pandas <code>mean()</code> and other aggregation methods support <code>numeric_only=False</code> parameter. <pre class="prettyprint lang-py prettyprint-override"><code>dropped.groupby('bank').mean(numeric_only=False) </code></pre> Found here: Aggregations for Timedelta values in the Python DataFrame

Finding the mean and standard deviation of a timedelta object in pandas df

Tags:

python

datetime

pandas

timedelta

mean

I would like to calculate the mean and standard deviation of a timedelta by bank from a dataframe with two columns shown below. When I run the code (also shown below) I get the below error:

pandas.core.base.DataError: No numeric types to aggregate

My dataframe:

   bank                          diff    Bank of Japan                 0 days 00:00:57.416000    Reserve Bank of Australia     0 days 00:00:21.452000    Reserve Bank of New Zealand  55 days 12:39:32.269000    U.S. Federal Reserve          8 days 13:27:11.387000

My code:

means = dropped.groupby('bank').mean() std = dropped.groupby('bank').std()

406

asked Jun 18 '17 15:06

Graham Streich

2 Answers

You need to convert timedelta to some numeric value, e.g. int64 by values what is most accurate, because convert to ns is what is the numeric representation of timedelta:

dropped['new'] = dropped['diff'].values.astype(np.int64)  means = dropped.groupby('bank').mean() means['new'] = pd.to_timedelta(means['new'])  std = dropped.groupby('bank').std() std['new'] = pd.to_timedelta(std['new'])

Another solution is to convert values to seconds by total_seconds, but that is less accurate:

dropped['new'] = dropped['diff'].dt.total_seconds()  means = dropped.groupby('bank').mean()

answered Oct 20 '22 15:10

jezrael

Pandas mean() and other aggregation methods support numeric_only=False parameter.

dropped.groupby('bank').mean(numeric_only=False)

Found here: Aggregations for Timedelta values in the Python DataFrame

answered Oct 20 '22 15:10

Alexander Usikov

Related questions
                            
                                Opening pdf file
                            
                                How to check a string for a special character?
                            
                                How to use awscli inside python script?
                            
                                How to Center Text in Pygame
                            
                                How can I log both successful and failed login and logout attempts in Django?
                            
                                Select row from a DataFrame based on the type of the object(i.e. str)
                            
                                AWS Elastic Beanstalk - Script timed out before returning headers: application.py
                            
                                Convert list into a pandas data frame
                            
                                Making a Python script Object-Oriented
                            
                                Python: passing a function with parameters as parameter [duplicate]
                            
                                Getting all items of QComboBox - PyQt4 (Python)
                            
                                For a Python dictionary, does iterkeys offer any advantages over viewkeys?
                            
                                What does the python interface to opencv2.fillPoly want as input?
                            
                                Django admin interface: using horizontal_filter with inline ManyToMany field
                            
                                ImportError: No module named PyQt4
                            
                                What are Python's equivalent of Javascript's reduce(), map(), and filter()?
                            
                                generalised insert into sqlalchemy using dictionary
                            
                                Python Flask Render Text from Variable like render_template
                            
                                How to turn a pandas dataframe row into a comma separated string
                            
                                How to do waffle charts in python? (square piechart)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With