Calculate average of every n rows from a csv file

Tags:

python

pandas

I have a csv file that has 25000 rows. I want to put the average of every 30 rows in another csv file.

I've given an example with 9 rows as below and the new csv file has 3 rows (3, 1, 2):

|   H    |
 ========
|   1    |---\
|   3    |   |--->| 3 |
|   5    |---/
|  -1    |---\
|   3    |   |--->| 1 |
|   1    |---/
|   0    |---\
|   5    |   |--->| 2 |
|   1    |---/

What I did:

import numpy as np
import pandas as pd

m_path = "file.csv"

m_df = pd.read_csv(m_path, usecols=['Col-01']) 
m_arr =  np.array([])
temp = m_df.to_numpy()
step = 30
for i in range(1, 25000, step):
    arr = np.append(m_arr,np.array([np.average(temp[i:i + step])]))

data = np.array(m_arr)[np.newaxis]

m_df = pd.DataFrame({'Column1': data[0, :]})
m_df.to_csv('AVG.csv')

This works well but Is there any other option to do this?

809

asked Mar 25 '20 14:03

Saeed

2 Answers

You can use integer division by step for consecutive groups and pass to groupby for aggregate mean:

step = 30
m_df = pd.read_csv(m_path, usecols=['Col-01']) 
df = m_df.groupby(m_df.index // step).mean()

Or:

df = m_df.groupby(np.arange(len(dfm_df// step).mean()

Sample data:

step = 3
df = m_df.groupby(m_df.index // step).mean()
print (df)
   H
0  3
1  1
2  2

189

answered Sep 17 '22 01:09

jezrael

You can get rolling mean using DataFrame.rolling and then filter result using slicing

df.rolling(3).mean()[2::3].reset_index(drop=True)
     a
0  3.0
1  1.0
2  2.0

answered Sep 18 '22 01:09

Dishin H Goyani

Related questions
                            
                                pandas groupby transform custom function
                            
                                How can I use git repos as dependencies for my PyPi package?
                            
                                Pandas GroupBy and Calculate Z-Score [duplicate]
                            
                                Trouble modifying the language option in selenium python bindings
                            
                                Unable to solve "ImportError: dynamic module does not define module export function"
                            
                                How do I correctly set MYPYPATH to pick up stubs for mypy?
                            
                                pytorch embedding index out of range
                            
                                How to resolve inconsistent package warnings in conda?
                            
                                Make Python script combined with linux packages easy installable for end-user
                            
                                How do I see the time it took to run my program in Visual Studio Code?
                            
                                Non-overlapping rolling windows in pandas dataframes
                            
                                How to efficiently use CountVectorizer to get ngram counts for all files in a directory combined?
                            
                                Implementing PCA with Numpy
                            
                                How to solve an error that appears in conda proxy configuration?
                            
                                Having trouble reading AWS config file with python configparser
                            
                                PyTorch RuntimeError: DataLoader worker (pid(s) 15332) exited unexpectedly
                            
                                How to properly use asyncio run_coroutine_threadsafe function?
                            
                                How to read from a high IO dataset in pytorch which grows from epoch to epoch
                            
                                How to put a label on a country with Python cartopy?
                            
                                Where to put .dockerignore?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With