difference between LinearRegression and svm.SVR(kernel="linear")

Tags:

First there are questions on this forum very similar to this one but trust me none matches so no duplicating please.

I have encountered two methods of linear regression using scikit's sklearn and I am failing to understand the difference between the two, especially where in first code there's a method train_test_split() called while in the other one directly fit method is called.

I am studying with multiple resources and this single issue is very confusing to me.

First which uses SVR

X = np.array(df.drop(['label'], 1))

X = preprocessing.scale(X)

y = np.array(df['label'])

X_train, X_test, y_train, y_test = cross_validation.train_test_split(X, y, test_size=0.2)

clf = svm.SVR(kernel='linear')

clf.fit(X_train, y_train)

confidence = clf.score(X_test, y_test)

And second is this one

# Split the data into training/testing sets
diabetes_X_train = diabetes_X[:-20]
diabetes_X_test = diabetes_X[-20:]

# Split the targets into training/testing sets
diabetes_y_train = diabetes.target[:-20]
diabetes_y_test = diabetes.target[-20:]

# Create linear regression object
regr = linear_model.LinearRegression()

# Train the model using the training sets
regr.fit(diabetes_X_train, diabetes_y_train)

# Make predictions using the testing set
diabetes_y_pred = regr.predict(diabetes_X_test)

So my main focus is the difference between using svr(kernel="linear") and using LinearRegression()

817

asked Oct 27 '17 08:10

Dev_Man

1 Answers

cross_validation.train_test_split : Splits arrays or matrices into random train and test subsets.

In second code, splitting is not random.

svm.SVR: The Support Vector Regression (SVR) uses the same principles as the SVM for classification, with only a few minor differences. First of all, because output is a real number it becomes very difficult to predict the information at hand, which has infinite possibilities. In the case of regression, a margin of tolerance (epsilon) is set in approximation to the SVM which would have already requested from the problem. But besides this fact, there is also a more complicated reason, the algorithm is more complicated therefore to be taken in consideration. However, the main idea is always the same: to minimize error, individualizing the hyperplane which maximizes the margin, keeping in mind that part of the error is tolerated.

Linear Regression: In statistics, linear regression is a linear approach for modeling the relationship between a scalar dependent variable y and one or more explanatory variables (or independent variables) denoted X. The case of one explanatory variable is called simple linear regression.

Reference: https://cs.adelaide.edu.au/~chhshen/teaching/ML_SVR.pdf

176

answered Sep 24 '22 01:09

Tushar Gupta

Related questions
                            
                                Matlab: neural network time series prediction?
                            
                                Multivariate time series forecasting with 3 months dataset
                            
                                PyTorch: is there a definitive training loop similar to Keras' fit()?
                            
                                How to sample large database and implement K-means and K-nn in R?
                            
                                How to integrate Apache Spark with Spring MVC web application for interactive user sessions
                            
                                Machine learning project: split training/test sets before or after exploratory data analysis?
                            
                                Reinforcement learning in C# [closed]
                            
                                How do you actually apply a trained model?
                            
                                Auto-encoders with tied weights in Caffe
                            
                                Choosing random_state for sklearn algorithms
                            
                                Keras. ValueError: I/O operation on closed file
                            
                                Cross validation with grid search returns worse results than default
                            
                                Is it possible to add your own WordNet to a library?
                            
                                Supervised Motion Detection Library
                            
                                Assign new data point to cluster in kernel k-means (kernlab package in R)?
                            
                                How to obtain information gain from a scikit-learn DecisionTreeClassifier?
                            
                                Python's implementation of Mutual Information
                            
                                what's the use of transformer_weights in scikit-learn pipeline?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

difference between LinearRegression and svm.SVR(kernel="linear")

Tags:

machine-learning

python-3.5

scikit-learn

regression

sklearn-pandas

Dev_Man

People also ask

1 Answers

Tushar Gupta

Recent Activity

Donate For Us