how to get the slope of a linear regression line using c++?

Tags:

I need to attain the slope of a linear regression similar to the way the Excel function in the below link is implemented:

http://office.microsoft.com/en-gb/excel-help/slope-function-HP010342903.aspx

Is there a library in C++ or a simple coded solution someone has created which can do this?

I have implemented code according to this formula, however it does not always give me the correct results (taken from here http://easycalculation.com/statistics/learn-regression.php) ....

Slope(b) = (NΣXY - (ΣX)(ΣY)) / (NΣX2 - (ΣX)2)
         = ((5)*(1159.7)-(311)*(18.6))/((5)*(19359)-(311)2)
         = (5798.5 - 5784.6)/(96795 - 96721)
         = 13.9/74
         = 0.19

If I try it against the following vectors, I get the wrong results (I should be expecting 0.305556): x = 6,5,11,7,5,4,4 y = 2,3,9,1,8,7,5

Thanks in advance.

541

asked Sep 22 '13 03:09

3 Answers

Here is a C++11 implementation:

#include <algorithm>
#include <iostream>
#include <numeric>
#include <vector>

double slope(const std::vector<double>& x, const std::vector<double>& y) {
    const auto n    = x.size();
    const auto s_x  = std::accumulate(x.begin(), x.end(), 0.0);
    const auto s_y  = std::accumulate(y.begin(), y.end(), 0.0);
    const auto s_xx = std::inner_product(x.begin(), x.end(), x.begin(), 0.0);
    const auto s_xy = std::inner_product(x.begin(), x.end(), y.begin(), 0.0);
    const auto a    = (n * s_xy - s_x * s_y) / (n * s_xx - s_x * s_x);
    return a;
}

int main() {
    std::vector<double> x{6, 5, 11, 7, 5, 4, 4};
    std::vector<double> y{2, 3, 9, 1, 8, 7, 5};
    std::cout << slope(x, y) << '\n';  // outputs 0.305556
}

You can add a test for the mathematical requirements (x.size() == y.size() and x is not constant) or, as the code above, assume that the user will take care of that.

answered Oct 23 '22 15:10

Cassio Neri

Why don't you just write a simple code like this (not the best solution, for sure, just an example based on the help article):

double slope(const vector<double>& x, const vector<double>& y){
    if(x.size() != y.size()){
        throw exception("...");
    }
    size_t n = x.size();

    double avgX = accumulate(x.begin(), x.end(), 0.0) / n;
    double avgY = accumulate(y.begin(), y.end(), 0.0) / n;

    double numerator = 0.0;
    double denominator = 0.0;

    for(size_t i=0; i<n; ++i){
        numerator += (x[i] - avgX) * (y[i] - avgY);
        denominator += (x[i] - avgX) * (x[i] - avgX);
    }

    if(denominator == 0.0){
        throw exception("...");
    }

    return numerator / denominator;
}

Note that the third argument of accumulate function must be 0.0 rather than 0, otherwise the compiler will deduct its type as int and there are great chances that the result of accumulate calls will be wrong (it's actually wrong using MSVC2010 and mingw-w64 when passing 0 as the third parameter).

answered Oct 23 '22 15:10

Qué Padre

The following is a templatized function I use for linear regression (fitting). It takes std::vector for data

template <typename T>
std::vector<T> GetLinearFit(const std::vector<T>& data)
{
    T xSum = 0, ySum = 0, xxSum = 0, xySum = 0, slope, intercept;
    std::vector<T> xData;
    for (long i = 0; i < data.size(); i++)
    {
        xData.push_back(static_cast<T>(i));
    }
    for (long i = 0; i < data.size(); i++)
    {
        xSum += xData[i];
        ySum += data[i];
        xxSum += xData[i] * xData[i];
        xySum += xData[i] * data[i];
    }
    slope = (data.size() * xySum - xSum * ySum) / (data.size() * xxSum - xSum * xSum);
    intercept = (ySum - slope * xSum) / data.size();
    std::vector<T> res;
    res.push_back(slope);
    res.push_back(intercept);
    return res;
}

The function returns a vector with the first element being the slope, and the second element being the intercept of your linear regression.

Example to use it:

std::vector<double> myData;
myData.push_back(1);
myData.push_back(3);
myData.push_back(4);
myData.push_back(2);
myData.push_back(5);

std::vector<double> linearReg = GetLinearFit(myData);
double slope = linearReg[0];
double intercept = linearReg[1];

Notice that the function presumes you have a series of numbers for your x-axis (which is what I needed). You may change that in the function if you wish.

answered Oct 23 '22 13:10

The Quantum Physicist

Related questions
                            
                                Typedef (alias) of an generic class
                            
                                Why is order of expressions in if statement important
                            
                                Should non-public functions be unit tested and how?
                            
                                Speed of C program execution
                            
                                Is it ever not safe to throw an exception in a constructor?
                            
                                How do you delete a pointer without deleting the data the pointer points to?
                            
                                how to make a not null-terminated c string?
                            
                                What's the Right Way to use the rand() Function in C++?
                            
                                How do you try out small/simple C or C++ source codes?
                            
                                Why uninitialized global variable is weak symbol?
                            
                                Cause of a stack overflow in this method (floating-point math)
                            
                                How to start modification with big projects
                            
                                Fast intersection of sets: C++ vs C#
                            
                                How to make this C++ code more DRY?
                            
                                How to not #include <windows.h>
                            
                                Integer Byte Swapping in C++
                            
                                Intersection between line and triangle in 3D
                            
                                Features of C++ that can't be implemented in C?
                            
                                Dynamically allocate C struct?
                            
                                How to zero out array in O(1)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to get the slope of a linear regression line using c++?

Tags:

c++

math

linear-regression

godzilla

People also ask

3 Answers

Cassio Neri

Qué Padre

The Quantum Physicist

Recent Activity

Donate For Us