Pandas and scikit-learn: KeyError: [....] not in index

Tags:

I do not understand why do I get the error KeyError: '[ 1351 1352 1353 ... 13500 13501 13502] not in index' when I run this code:

cv = KFold(n_splits=10)

for train_index, test_index in cv.split(X):
    f_train_X, f_valid_X = X[train_index], X[test_index]
    f_train_y, f_valid_y = y[train_index], y[test_index]

I use X (a Pandas dataframe) to split I cv.split(X).

X.shape
y.shape
Out: (13503, 17)
Out: (13503,)

318

asked Jun 28 '18 20:06

ScalaBoy

1 Answers

The problem is the way you are trying to index the X using X[train_index]. You need to use .loc or .iloc since you have pandas dataframe.

Use this

cv = KFold(n_splits=10)

for train_index, test_index in cv.split(X):
    f_train_X, f_valid_X = X.iloc[train_index], X.iloc[test_index]
    f_train_y, f_valid_y = y.iloc[train_index], y.iloc[test_index]

1st way: Example using `iloc`

import pandas as pd
import numpy as np

df = pd.DataFrame(np.random.randint(0,100,size=(100, 4)), columns=list('ABCD'))

df[[1,2]]
#KeyError: '[1 2] not in index'

df.iloc[[1,2]]
#    A   B   C   D
#1  25  97  78  74
#2   6  84  16  21

2nd way: Example by converting pandas to numpy in advance

df = df.values

#now this should work fine
df[[1,2]]
#array([[25, 97, 78, 74],
#      [ 6, 84, 16, 21]])

answered Oct 18 '22 09:10

seralouk

Related questions
                            
                                Django datetime field - convert to timezone in view
                            
                                python abstract property setter with concrete getter
                            
                                Asynchronous property setter
                            
                                copying one file's contents to another in python
                            
                                Pandas - unstack column values into new columns
                            
                                %USERPROFILE% env variable for python
                            
                                How can I print a euro (€) symbol in Python?
                            
                                Install tensorflow on Ubuntu 14.04
                            
                                How to set up a Selenium Python environment for Firefox
                            
                                Using alembic.config.main redirects log output
                            
                                How to convert bitarray to an integer in python
                            
                                Keras embedding layers: how do they work?
                            
                                Remove duplicate rows from Pandas dataframe where only some columns have the same value
                            
                                Datetime in pandas dataframe will not subtract from each other
                            
                                Exact field search in the Django admin
                            
                                Pylint: Disable Unnecessary "else" after "return" (no-else-return) warning
                            
                                Use Django ORM outside of Django
                            
                                "cannot create temp dir for user data dir" error when not running as admin
                            
                                Celery beat not picking up periodic tasks
                            
                                python cannot import timezone but can import datetime

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas and scikit-learn: KeyError: [....] not in index

Tags:

python

pandas

scikit-learn

ScalaBoy

People also ask

1 Answers

Use this

1st way: Example using `iloc`

2nd way: Example by converting pandas to numpy in advance

seralouk

Recent Activity

Donate For Us

Pandas and scikit-learn: KeyError: [....] not in index

Tags:

python

pandas

scikit-learn

ScalaBoy

People also ask

1 Answers

Use this

1st way: Example using iloc

2nd way: Example by converting pandas to numpy in advance

seralouk

Related questions

Recent Activity

Donate For Us

1st way: Example using `iloc`