How to plot a time series array, with confidence intervals displayed, in python?

Tags:

I have some time series which slowly increases, but over a short period of time they are very wavy. For example, the time series could look like:

[10 + np.random.rand() for i in range(100)] + [12 + np.random.rand() for i in range(100)] + [14 + np.random.rand() for i in range(100)]

I would like to plot the time series with a focus on the general trend, not on the small waves. Is there a way to plot the mean over a period of time surrounded with a stripe indicating the waves (the stripe should represent the confidence interval, where the data point could be in that moment)?

A simple plot would look like this:

enter image description here

The plot which I would like, with confidence intervals would look like this:

enter image description here

Is there an elegant way to do it in Python?

871

asked May 03 '18 17:05

Ștefan

2 Answers

You could use pandas function rolling(n) to generate the mean and standard deviation values over n consecutive points.

For the shade of the confidence intervals (represented by the space between standard deviations) you can use the function fill_between() from matplotlib.pyplot. For more information you could take a look over here, from which the following code is inspired.

import numpy             as np
import pandas            as pd
import matplotlib.pyplot as plt

#Declare the array containing the series you want to plot. 
#For example:
time_series_array = np.sin(np.linspace(-np.pi, np.pi, 400)) + np.random.rand((400))
n_steps           = 15 #number of rolling steps for the mean/std.

#Compute curves of interest:
time_series_df = pd.DataFrame(time_series_array)
smooth_path    = time_series_df.rolling(n_steps).mean()
path_deviation = 2 * time_series_df.rolling(n_steps).std()

under_line     = (smooth_path-path_deviation)[0]
over_line      = (smooth_path+path_deviation)[0]

#Plotting:
plt.plot(smooth_path, linewidth=2) #mean curve.
plt.fill_between(path_deviation.index, under_line, over_line, color='b', alpha=.1) #std curves.

With the above code you obtain something like this: enter image description here

answered Sep 17 '22 09:09

Ștefan

Looks like, you're doubling the std twice. I guess it should be like this:

time_series_df = pd.DataFrame(time_series_array)
smooth_path = time_series_df.rolling(20).mean()
path_deviation = time_series_df.rolling(20).std()
plt.plot(smooth_path, linewidth=2)
plt.fill_between(path_deviation.index, (smooth_path-2*path_deviation)[0], (smooth_path+2*path_deviation)[0], color='b', alpha=.1)

answered Sep 19 '22 09:09

flrndttrch

Related questions
                            
                                Can pandas read a transposed CSV?
                            
                                How to select last row and also how to access PySpark dataframe by index?
                            
                                Determining Pandas Column DataType
                            
                                Django breaking long lookup names on queries
                            
                                Django queryset annotate field to be a list/queryset
                            
                                How to add black border to matplotlib 2.0 `ax` object In Python 3?
                            
                                How to connect broken lines in a binary image using Python/Opencv
                            
                                Keras input_shape for conv2d and manually loaded images
                            
                                Matplotlib: 3D surface plot turn off background but keep axes
                            
                                TypeError: unhashable type: 'list' when use groupby in python
                            
                                How to split a string into command line arguments like the shell in python?
                            
                                What is "Pure Python?"
                            
                                Concat two arrays of different dimensions numpy
                            
                                py.test deal with both pylint and flake8 when importing features from a module
                            
                                Mouse scroll wheel with selenium webdriver, on element without scrollbar?
                            
                                Can I put a class definition into __init__.py?
                            
                                Using sample_weight in Keras for sequence labelling
                            
                                How to use a Keras RNN model to forecast for future dates or events?
                            
                                How to upload file to google drive with service account credential
                            
                                tweepy get tweets between two dates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to plot a time series array, with confidence intervals displayed, in python?

Tags:

python

matplotlib

plot

seaborn

confidence-interval

Ștefan

People also ask

2 Answers

Ștefan

flrndttrch

Recent Activity

Donate For Us