Why does not GridSearchCV give best score ? - Scikit Learn

Tags:

I have a dataset with 158 rows and 10 columns. I try to build multiple linear regression model and try to predict future value.

I used GridSearchCV for tunning parameters.

Here is my GridSearchCV and Regression function :

def GridSearch(data):
    X_train, X_test, y_train, y_test = cross_validation.train_test_split(data, ground_truth_data, test_size=0.3, random_state = 0)
    
    parameters = {'fit_intercept':[True,False], 'normalize':[True,False], 'copy_X':[True, False]}
    
    model = linear_model.LinearRegression()
    
    grid = GridSearchCV(model,parameters)
    
    grid.fit(X_train, y_train)
    predictions = grid.predict(X_test)
    
    print "Grid best score: ", grid.best_score_
    print "Grid score function: ", grid.score(X_test,y_test)

Output of this code is :

Grid best score: 0.720298870251

Grid score function: 0.888263112299

My question is what is the difference between best_score_ and score function ?

How the score function can be better than the best_score function ?

Thanks in advance.

651

asked May 25 '15 16:05

Batuhan B

1 Answers

The best_score_ is the best score from the cross-validation. That is, the model is fit on part of the training data, and the score is computed by predicting the rest of the training data. This is because you passed X_train and y_train to fit; the fit process thus does not know anything about your test set, only your training set.

The score method of the model object scores the model on the data you give it. You passed X_test and y_test, so this call computes the score of the fit (i.e., tuned) model on the test set.

In short, the two scores are calculated on different data sets, so it shouldn't be surprising that they are different.

answered Oct 20 '22 00:10

BrenBarn

Related questions
                            
                                How do i monitor the progress of a file transfer through pysftp
                            
                                Using Selenium with PyCharm CE
                            
                                How to Combine pyWavelet and openCV for image processing?
                            
                                A fast way to find nonzero entries by row in a sparse matrix in Python
                            
                                Python recursive function to display all subsets of given set
                            
                                Convert seconds to minutes and seconds
                            
                                Tkinter Menu command targets function with arguments?
                            
                                Reproduce uuid from java code in python
                            
                                configparser without whitespace surrounding operator
                            
                                Unzip zip files in folders and subfolders
                            
                                Modify pandas dataframe values with numpy array
                            
                                How to check that mongo ObjectID is valid in python?
                            
                                Undo Overwrite of Python Built-In
                            
                                How to quantitatively measure goodness of fit in SciPy?
                            
                                How to iterate over `dict` in class like if just referring to `dict`?
                            
                                python: calculate center of mass
                            
                                Trouble installing pygame using pip install
                            
                                how to change the subject for Django error reporting emails?
                            
                                How to put two decimals in cell with type of percent
                            
                                Python: how to get values from a dictionary from pandas series

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does not GridSearchCV give best score ? - Scikit Learn

Tags:

python

r

machine-learning

scikit-learn

regression

Batuhan B

People also ask

1 Answers

BrenBarn

Recent Activity

Donate For Us