Increasing cost for linear regression

Tags:

I implemented, for training purpose, a linear regression in python. The problem is that the cost is increasing instead of decreasing. For the data I use the Airfoil Self-Noise Data Set. Data can be found here

I import data as follow :

import pandas as pd

def features():

    features = pd.read_csv("data/airfoil_self_noise/airfoil_self_noise.dat.txt", sep="\t", header=None)

    X = features.iloc[:, 0:5]
    Y = features.iloc[:, 5]

    return X.values, Y.values.reshape(Y.shape[0], 1)

My code for the linear regression is the following :

import numpy as np
import random

class linearRegression():

    def __init__(self, learning_rate=0.01, max_iter=20):
        """
        Initialize the hyperparameters of the linear regression.

        :param learning_rate: the learning rate
        :param max_iter: the max numer of iteration to perform
        """

        self.lr = learning_rate
        self.max_iter = max_iter
        self.m = None
        self.weights = None
        self.bias = None

    def fit(self, X, Y):
        """
        Run gradient descent algorithm

        :param X: the inputs
        :param Y: the outputs
        :return:
        """

        self.m = X.shape[0]
        self.weights = np.random.normal(0, 0.1, (X.shape[1], 1))
        self.bias = random.normalvariate(0, 0.1)

        for iter in range(0, self.max_iter):

            A = self.__forward(X)
            dw, db = self.__backward(A, X, Y)

            J = (1/(2 * self.m)) * np.sum(np.power((A - Y), 2))

            print("at iteration %s cost is %s" % (iter, J))

            self.weights = self.weights - self.lr * dw
            self.bias = self.bias - self.lr * db

    def predict(self, X):
        """
        Make prediction on the inputs

        :param X: the inputs
        :return:
        """

        Y_pred = self.__forward(X)

        return Y_pred

    def __forward(self, X):
        """
        Compute the linear function on the inputs

        :param X: the inputs
        :return:
            A: the activation
        """

        A = np.dot(X, self.weights) + self.bias

        return A

    def __backward(self, A, X, Y):
        """

        :param A: the activation
        :param X: the inputs
        :param Y: the outputs
        :return:
            dw: the gradient for the weights
            db: the gradient for the bias
        """

        dw = (1 / self.m) * np.dot(X.T, (A - Y))
        db = (1 / self.m) * np.sum(A - Y)

        return dw, db

Then I instantiate the linearRegression class as follow :

X, Y = features()
model = linearRegression()
X_train, X_test, y_train, y_test = train_test_split(X, Y, test_size=0.33, random_state=42)
model.fit(X_train, y_train)

I tried to find why the cost is increasing but so far I was not able to find out why. If someone could point me in the right direction it would be appreciated.

277

asked Aug 30 '18 09:08

Vetouz

2 Answers

Normally if you chose a large learning rate you may have a similar problem. I have tried to examine your code and my observations are:

your cost function J seems alright.
but in your backwards function you seem to subtract your actual results from your guesses. By doing so you may get negative weights and since you are subtracting multiplication of your learning rate and your rate from the weights and gradients you end up getting increased cost function results

162

answered Nov 02 '22 23:11

ozata

Your learning rate is much too high. When I run your code unmodified with except for a learning rate of 1e-7 instead of 0.01, I get reliably decreasing costs.

answered Nov 02 '22 23:11

Ingo

Related questions
                            
                                SciPy SVD vs. Numpy SVD
                            
                                descriptor 'time' of 'datetime.datetime' object needs an argument
                            
                                Python requests API using proxy for https request get 407 Proxy Authentication Required
                            
                                Python - Plotting velocity and acceleration vectors at certain points
                            
                                Where should you update Celery settings? On the remote worker or sender?
                            
                                Gauss-Legendre over intervals -x -> infinity: adaptive algorithm to transform weights and nodes efficiently
                            
                                Fastest way to parse JSON strings into numpy arrays
                            
                                Change rows order pandas data frame
                            
                                How to estimate the progress of a GridSearchCV from verbose output in Scikit-Learn?
                            
                                Return by reference possible?
                            
                                POST HTML5 audio data to server
                            
                                Why does int(maxint) give a long, but int(int(maxint)) give an int? Is this a NumPy bug?
                            
                                How to serve Django for an Electron app
                            
                                Differences between CV2 image processing and tf.image processing
                            
                                How do I trigger Airflow -dag using TriggerDagRunOperator
                            
                                Creating many feature columns in Tensorflow
                            
                                Simple way to Dask concatenate (horizontal, axis=1, columns)
                            
                                Second parameter of super()?
                            
                                jellyfish vs pyjarowinkler
                            
                                where to get and install crypto.dll on 64 bit Windows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Increasing cost for linear regression

Tags:

python

machine-learning

linear-regression

Vetouz

People also ask

2 Answers

ozata

Ingo

Recent Activity

Donate For Us