Finding moving average from data points in Python

Tags:

I am playing in Python a bit again, and I found a neat book with examples. One of the examples is to plot some data. I have a .txt file with two columns and I have the data. I plotted the data just fine, but in the exercise it says: Modify your program further to calculate and plot the running average of the data, defined by:

$Y_k=\frac{1}{2r}\sum_{m=-r}^r y_{k+m}$

where r=5 in this case (and the y_k is the second column in the data file). Have the program plot both the original data and the running average on the same graph.

So far I have this:

from pylab import plot, ylim, xlim, show, xlabel, ylabel from numpy import linspace, loadtxt  data = loadtxt("sunspots.txt", float) r=5.0  x = data[:,0] y = data[:,1]  plot(x,y) xlim(0,1000) xlabel("Months since Jan 1749.") ylabel("No. of Sun spots") show()

So how do I calculate the sum? In Mathematica it's simple since it's symbolic manipulation (Sum[i, {i,0,10}] for example), but how to calculate sum in python which takes every ten points in the data and averages it, and does so until the end of points?

I looked at the book, but found nothing that would explain this :\

heltonbiker's code did the trick ^^ :D

from __future__ import division from pylab import plot, ylim, xlim, show, xlabel, ylabel, grid from numpy import linspace, loadtxt, ones, convolve import numpy as numpy  data = loadtxt("sunspots.txt", float)  def movingaverage(interval, window_size):     window= numpy.ones(int(window_size))/float(window_size)     return numpy.convolve(interval, window, 'same')  x = data[:,0] y = data[:,1]   plot(x,y,"k.") y_av = movingaverage(y, 10) plot(x, y_av,"r") xlim(0,1000) xlabel("Months since Jan 1749.") ylabel("No. of Sun spots") grid(True) show()

And I got this:

Thank you very much ^^ :)

759

asked Jul 05 '12 20:07

dingo_d

2 Answers

As numpy.convolve is pretty slow, those who need a fast performing solution might prefer an easier to understand cumsum approach. Here is the code:

cumsum_vec = numpy.cumsum(numpy.insert(data, 0, 0))  ma_vec = (cumsum_vec[window_width:] - cumsum_vec[:-window_width]) / window_width

where data contains your data, and ma_vec will contain moving averages of window_width length.

On average, cumsum is about 30-40 times faster than convolve.

103

answered Sep 24 '22 10:09

Roman Kh

Before reading this answer, bear in mind that there is another answer below, from Roman Kh, which uses numpy.cumsum and is MUCH MUCH FASTER than this one.

~~Best~~ One common way to apply a moving/sliding average (or any other sliding window function) to a signal is by using numpy.convolve().

def movingaverage(interval, window_size):     window = numpy.ones(int(window_size))/float(window_size)     return numpy.convolve(interval, window, 'same')

Here, interval is your x array, and window_size is the number of samples to consider. The window will be centered on each sample, so it takes samples before and after the current sample in order to calculate the average. Your code would become:

plot(x,y) xlim(0,1000)  x_av = movingaverage(interval, r) plot(x_av, y)  xlabel("Months since Jan 1749.") ylabel("No. of Sun spots") show()

Hope this helps!

answered Sep 26 '22 10:09

heltonbiker

Related questions
                            
                                trying to install pymssql on ubuntu 12.04 using pip
                            
                                Python version 2.6 required, which was not found in the registry
                            
                                Profiling in Python: Who called the function?
                            
                                python tracing a segmentation fault
                            
                                Limit number of characters with Django Template filter
                            
                                add columns different length pandas
                            
                                Popen error: [Errno 2] No such file or directory
                            
                                pandas comparison raises TypeError: cannot compare a dtyped [float64] array with a scalar of type [bool]
                            
                                Python webbrowser.open() to open Chrome browser
                            
                                How to add the current query string to an URL in a Django template?
                            
                                'True' and 'False' in Python
                            
                                Escape double quotes for JSON in Python
                            
                                How do I get the value of a tensor in PyTorch?
                            
                                Stream large binary files with urllib2 to file
                            
                                Reading Unicode file data with BOM chars in Python
                            
                                Save MinMaxScaler model in sklearn
                            
                                How to tell if a date is between two other dates?
                            
                                Error installing python-snappy: snappy-c.h: No such file or directory
                            
                                Javascript - No 'Access-Control-Allow-Origin' header is present on the requested resource
                            
                                Split an integer into digits to compute an ISBN checksum

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Finding moving average from data points in Python

Tags:

python

plot

sum

average

dingo_d

People also ask

2 Answers

Roman Kh

heltonbiker

Recent Activity

Donate For Us