Cross-validation in sklearn: do I need to call fit() as well as cross_val_score()?

Tags:

I would like to use k-fold cross validation while learning a model. So far I am doing it like this:

# splitting dataset into training and test sets
X_train, X_test, y_train, y_test = train_test_split(dataset_1, df1['label'], test_size=0.25, random_state=4222)

# learning a model
model = MultinomialNB()
model.fit(X_train, y_train)
scores = cross_val_score(model, X_train, y_train, cv=5)

At this step I am not quite sure whether I should use model.fit() or not, because in the official documentation of sklearn they do not fit but just call cross_val_score as following (they do not even split the data into training and test sets):

from sklearn.model_selection import cross_val_score
clf = svm.SVC(kernel='linear', C=1)
scores = cross_val_score(clf, iris.data, iris.target, cv=5)

I would like to tune the hyper parameters of the model while learning the model. What is the right pipeline?

749

asked May 14 '18 11:05

torayeff

1 Answers

Your second example is right for doing the cross validation. See the example here: http://scikit-learn.org/stable/modules/cross_validation.html#computing-cross-validated-metrics

The fitting will be done inside the cross_val_score function, you don't need to worry about this beforehand.

[Edited] If, besides cross validation, you want to train a model, you can call model.fit() afterwards.

answered Oct 20 '22 00:10

markus-hinsche

Related questions
                            
                                Convert 1d array to lower triangular matrix
                            
                                Dlib installation error? [duplicate]
                            
                                $PYTHONSTARTUP with python 2.7 and python 3.2
                            
                                What are some rules of thumb for deciding between __get__, __getattr__, and __getattribute__?
                            
                                Sorting by absolute value without changing to absolute value
                            
                                Post unicode string to web service using Python Requests library
                            
                                Non-blocking file read
                            
                                Get length of Queue in Python's multiprocessing library
                            
                                How to split a DataFrame in pandas in predefined percentages?
                            
                                Permission System for Discord.py Bot
                            
                                concurrent.futures.ThreadPoolExecutor swallowing exceptions (Python 3.6)
                            
                                Pandas drop duplicates on elements made of lists
                            
                                TF2.0: Translation model: Error when restoring the saved model: Unresolved object in checkpoint (root).optimizer.iter: attributes
                            
                                How can I add python to cmd in windows [closed]
                            
                                Py3k: What's more pythonic - one import with commas or many imports?
                            
                                python 3 in emacs
                            
                                Cx-Freeze Error - Python 34
                            
                                pandas dataframe column name: remove special character
                            
                                Pymongo : insert_many + unique index
                            
                                Counting number of documents in an index in elasticsearch

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cross-validation in sklearn: do I need to call fit() as well as cross_val_score()?

Tags:

python-3.x

scikit-learn

cross-validation

torayeff

People also ask

1 Answers

markus-hinsche

Recent Activity

Donate For Us