Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to pass elegantly Sklearn's GridseachCV's best parameters to another model?

I have found a set of best hyperparameters for my KNN estimator with Grid Search CV:

>>> knn_gridsearch_model.best_params_
{'algorithm': 'auto', 'metric': 'manhattan', 'n_neighbors': 3}

So far, so good. I want to train my final estimator with these new-found parameters. Is there a way to feed the above hyperparameter dict to it directly? I tried this:

>>> new_knn_model = KNeighborsClassifier(knn_gridsearch_model.best_params_)

but instead the hoped result new_knn_model just got the whole dict as the first parameter of the model and left the remaining ones as default:

>>> knn_model
KNeighborsClassifier(algorithm='auto', leaf_size=30, metric='minkowski',
           metric_params=None, n_jobs=1,
           n_neighbors={'n_neighbors': 3, 'metric': 'manhattan', 'algorithm': 'auto'},
           p=2, weights='uniform')

Disappointing indeed.

like image 249
Hendrik Avatar asked Jul 13 '17 07:07

Hendrik


People also ask

How does cross-validation work in GridSearchCV?

Cross-Validation and GridSearchCV Cross-Validation is used while training the model. As we know that before training the model with data, we divide the data into two parts – train data and test data. In cross-validation, the process divides the train data further into two parts – the train data and the validation data.


2 Answers

You can do that as follows:

new_knn_model = KNeighborsClassifier()
new_knn_model.set_params(**knn_gridsearch_model.best_params_)

Or just unpack directly as @taras suggested:

new_knn_model = KNeighborsClassifier(**knn_gridsearch_model.best_params_)

By the way, after finish running the grid search, the grid search object actually keeps (by default) the best parameters, so you can use the object itself. Alternatively, you could also access the classifier with the best parameters through

gs.best_estimator_
like image 106
Miriam Farber Avatar answered Oct 13 '22 23:10

Miriam Farber


I just want to point out that using the grid.best_parameters and pass them to a new model by unpacking like:

my_model = KNeighborsClassifier(**grid.best_params_)

is good and all and I personally used it a lot.
However, as you can see in the documentation here, if your goal is to predict something using those best_parameters, you can directly use the grid.predict method which will use these best parameters for you by default.

example:

y_pred = grid.predict(X_test)

Hope this was helpful.

like image 34
Rayhane Mama Avatar answered Oct 14 '22 00:10

Rayhane Mama