Multivariate polynomial regression with Python

Tags:

Recently I started to learn sklearn, numpy and pandas and I made a function for multivariate linear regression. Im wondering, is it possible to make multivariate polynomial regression?

This is my code for multivariate polynomial regression, it shows this error:

in check_consistent_length " samples: %r" % [int(l) for l in lengths])
ValueError: Found input variables with inconsistent numbers of samples: [8, 3]

Do you know whats the problem?

import numpy as np
import pandas as pd
import xlrd
from sklearn import linear_model
from sklearn.model_selection import train_test_split

def polynomial_prediction_of_future_strenght(input_data, cement, blast_fur_slug,fly_ash,
                                              water, superpl, coarse_aggr, fine_aggr, days):

    variables = prediction_accuracy(input_data)[4]
    results = prediction_accuracy(input_data)[5]

    var_train, var_test, res_train, res_test = train_test_split(variables, results, test_size = 0.3, random_state = 4)

    Poly_Regression = PolynomialFeatures(degree=2)
    poly_var_train = Poly_Regression.fit_transform(var_train)
    poly_var_test = Poly_Regression.fit_transform(var_test)

    input_values = [cement, blast_fur_slug, fly_ash, water, superpl, coarse_aggr, fine_aggr, days]

    regression = linear_model.LinearRegression()
    model = regression.fit(poly_var_train, res_train)

    predicted_strenght = regression.predict([input_values])
    predicted_strenght = round(predicted_strenght[0], 2)

    score = model.score(poly_var_test, res_test)
    score = round(score*100, 2)


    print(prediction, score)

a = polynomial_prediction_of_future_strenght(data_less_than_28days, 260.9, 100.5, 78.3, 200.6, 8.6, 864.5, 761.5, 28)

521

asked Feb 26 '19 18:02

taga

1 Answers

You can transform your features to polynomial using this sklearn module and then use these features in your linear regression model.

from sklearn.preprocessing import PolynomialFeatures
from sklearn import linear_model

poly = PolynomialFeatures(degree=2)
poly_variables = poly.fit_transform(variables)

poly_var_train, poly_var_test, res_train, res_test = train_test_split(poly_variables, results, test_size = 0.3, random_state = 4)

regression = linear_model.LinearRegression()

model = regression.fit(poly_var_train, res_train)
score = model.score(poly_var_test, res_test)

Also, in your code you are training your model on the entire dataset and then you split it into train and test. This means that your model has already seen your test data while training. You need to split first, then train your model only on training data and then test the score on the test set. I have included these changes as well. :)

121

answered Oct 12 '22 12:10

panktijk

Related questions
                            
                                Download multiple file from Google cloud storage using Python
                            
                                Python 3.5 create .rpm with pyinstaller generated executable
                            
                                What does rtype mean in Python?
                            
                                Python script in Power BI returns date as Microsoft.OleDb.Date
                            
                                Group by and aggregate columns but create NaN if values do not match
                            
                                How to check an object has the type 'dict_items'?
                            
                                How is ternary operator implemented in Python
                            
                                Possible Combination of Parentheses in a Matrix Chain Application
                            
                                Converting a DateTime Index value to an Index Number
                            
                                Implementing ROC Curves for K-NN machine learning algorithm using python and Scikit Learn
                            
                                Pickling dict in Python
                            
                                Sorting pandas dataframe by weekdays
                            
                                numpy find the max value in a row and return back to it's column index
                            
                                How to debug Tensorflow segmentation fault in model.fit()?
                            
                                Difference between multiprocessing.cpu_count and os.cpu_count
                            
                                What does the 'm' in a Python ABI tag mean?
                            
                                What is the difference between MLP implementation from scratch and in PyTorch?
                            
                                How to redirect -progress option output of ffmpeg to stderr?
                            
                                How to add calculated column to Dataframe counting frequency in column in pandas
                            
                                What is a time complexity of move_to_end operation for OrderedDict in Python 3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multivariate polynomial regression with Python

Tags:

python

scikit-learn

regression

taga

People also ask

1 Answers

panktijk

Recent Activity

Donate For Us