sklearn's KFold function with shuffle and random_state

Question

I'm trying to understand how to use the cross-validation function sklearn.model_selection.KFold. If I define (like in this tutorial)

from sklearn.model_selection import KFold

kf = KFold(n_splits=5, shuffle=False, random_state=100)

I get

ValueError: Setting a random_state has no effect since shuffle is False.
You should leave random_state to its default (None), or set shuffle=True.

What does this error mean and why is it necessary to set random_state=None or shuffle=True?

Taylrl · Accepted Answer

Shuffling in this context means that the data is first randomly shuffled before splitting into test/train. The random_state will allow the way in which the data is shuffled to be repeatable. Without the shuffling switched on, the random_state has no meaning.

sklearn's KFold function with shuffle and random_state

Tags:

python

machine-learning

scikit-learn

k-fold

Medulla Oblongata

1 Answers

Taylrl

Recent Activity

Donate For Us

sklearn's KFold function with shuffle and random_state

Tags:

python

machine-learning

scikit-learn

k-fold

Medulla Oblongata

1 Answers

Taylrl

Related questions

Recent Activity

Donate For Us