How to compute and plot a LOWESS curve in Python?

Tags:

How can I find and plot a LOWESS curve that looks like the following using Python?

loess curve example

I'm aware of the LOWESS implementation in statsmodels, but it doesn't seem to be able to give me 95% confidence interval lines that I can shade between. Seaborn has a method that calls the statsmodels implementation, but it can't plot the confidence intervals.

Other StackOverflow answers give code to draw a LOESS/LOWESS line, but none with a confidence interval. Can anyone assist with this? Is anyone aware of an existing implementation that would enable me to do this?

Thanks in advance.

845

asked Mar 06 '17 23:03

Suriname0

1 Answers

I found a link here is useful, and I put code below:

def lowess(x, y, f=1./3.):
    """
    Basic LOWESS smoother with uncertainty. 
    Note:
        - Not robust (so no iteration) and
             only normally distributed errors. 
        - No higher order polynomials d=1 
            so linear smoother.
    """
    # get some paras
    xwidth = f*(x.max()-x.min()) # effective width after reduction factor
    N = len(x) # number of obs
    # Don't assume the data is sorted
    order = np.argsort(x)
    # storage
    y_sm = np.zeros_like(y)
    y_stderr = np.zeros_like(y)
    # define the weigthing function -- clipping too!
    tricube = lambda d : np.clip((1- np.abs(d)**3)**3, 0, 1)
    # run the regression for each observation i
    for i in range(N):
        dist = np.abs((x[order][i]-x[order]))/xwidth
        w = tricube(dist)
        # form linear system with the weights
        A = np.stack([w, x[order]*w]).T
        b = w * y[order]
        ATA = A.T.dot(A)
        ATb = A.T.dot(b)
        # solve the syste
        sol = np.linalg.solve(ATA, ATb)
        # predict for the observation only
        yest = A[i].dot(sol)# equiv of A.dot(yest) just for k
        place = order[i]
        y_sm[place]=yest 
        sigma2 = (np.sum((A.dot(sol) -y [order])**2)/N )
        # Calculate the standard error
        y_stderr[place] = np.sqrt(sigma2 * 
                                A[i].dot(np.linalg.inv(ATA)
                                                    ).dot(A[i]))
    return y_sm, y_stderr


import numpy as np
import matplotlib.pyplot as plt


# make some data
x = 5*np.random.random(100)
y = np.sin(x) * 3*np.exp(-x) + np.random.normal(0, 0.2, 100)
order = np.argsort(x)

#run it
y_sm, y_std = lowess(x, y, f=1./5.)
# plot it
plt.plot(x[order], y_sm[order], color='tomato', label='LOWESS')
plt.fill_between(x[order], y_sm[order] - 1.96*y_std[order],
                 y_sm[order] + 1.96*y_std[order], alpha=0.3, label='LOWESS uncertainty')
plt.plot(x, y, 'k.', label='Observations')
plt.legend(loc='best')
#run it
y_sm, y_std = lowess(x, y, f=1./5.)
# plot it
plt.plot(x[order], y_sm[order], color='tomato', label='LOWESS')
plt.fill_between(x[order], y_sm[order] - y_std[order],
                 y_sm[order] + y_std[order], alpha=0.3, label='LOWESS uncertainty')
plt.plot(x, y, 'k.', label='Observations')
plt.legend(loc='best')

enter image description here

answered Oct 04 '22 03:10

ted930511

Related questions
                            
                                Updating an old system to Q-learning with Neural Networks
                            
                                Interactive debugging in IPython (Jupyter) notebook
                            
                                Python Pandas Series failure datetime
                            
                                How to Mock Import for Read the Docs?
                            
                                Adding element to a dictionary in python?
                            
                                Call another setup.py in setup.py
                            
                                Python 2.7 Cx_Freeze: ImportError: No module named __startup__
                            
                                How to include the default TEMPLATE_CONTEXT_PROCESSORS in the new TEMPLATES setting in Django 1.10
                            
                                imageio: How to increase quality of output gifs?
                            
                                python script crashes after long time running
                            
                                Tensorflow training becomes slower and slower when iteration is more than 10,000. Why?
                            
                                Pandas: Write CSV file with Windows line ending
                            
                                Paragraph Segmentation using Machine Learning
                            
                                How to decompress mongo journal files
                            
                                Pytest not showing test results in the terminal window
                            
                                Why are Python lists implemented as dynamic arrays instead of ring buffers?
                            
                                OpenCV background subtraction learning rate cannot change
                            
                                Custom RMSPE loss function in keras
                            
                                Natural sort on Django Queryset
                            
                                PyCharm not displaying Google style docstrings in tooltips

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to compute and plot a LOWESS curve in Python?

Tags:

python

matplotlib

plot

statistics

statsmodels

Suriname0

People also ask

1 Answers

ted930511

Recent Activity

Donate For Us