Scikit-learn cross val score: too many indices for array

Tags:

I have the following code

 from sklearn.ensemble import ExtraTreesClassifier  from sklearn.cross_validation import cross_val_score  #split the dataset for train and test  combnum['is_train'] = np.random.uniform(0, 1, len(combnum)) <= .75  train, test = combnum[combnum['is_train']==True], combnum[combnum['is_train']==False]   et = ExtraTreesClassifier(n_estimators=200, max_depth=None, min_samples_split=10, random_state=0)  min_samples_split=10, random_state=0  )   labels = train[list(label_columns)].values  tlabels = test[list(label_columns)].values   features = train[list(columns)].values  tfeatures = test[list(columns)].values   et_score = cross_val_score(et, features, labels, n_jobs=-1)  print("{0} -> ET: {1})".format(label_columns, et_score))

Checking the shape of the arrays:

 features.shape  Out[19]:(43069, 34)

And

labels.shape Out[20]:(43069, 1)

and I'm getting:

IndexError: too many indices for array

and this relevant part of the traceback:

---> 22 et_score = cross_val_score(et, features, labels, n_jobs=-1)

I'm creating the data from Pandas dataframes and I searched here and saw some reference to possible errors via this method but can't figure out how to correct? What the data arrays look like: features

Out[21]: array([[ 0.,  1.,  1., ...,  0.,  0.,  1.],    [ 0.,  1.,  1., ...,  0.,  0.,  1.],    [ 1.,  1.,  1., ...,  0.,  0.,  1.],    ...,     [ 0.,  0.,  1., ...,  0.,  0.,  1.],    [ 0.,  0.,  1., ...,  0.,  0.,  1.],    [ 0.,  0.,  1., ...,  0.,  0.,  1.]])

labels

Out[22]: array([[1],    [1],    [1],    ...,     [1],    [1],    [1]])

416

asked Aug 13 '15 17:08

dartdog

1 Answers

When we do cross validation in scikit-learn, the process requires an (R,) shape label instead of (R,1). Although they are the same thing to some extend, their indexing mechanisms are different. So in your case, just add:

c, r = labels.shape labels = labels.reshape(c,)

before passing it to the cross-validation function.

146

answered Oct 23 '22 06:10

YE LIANG HARRY

Related questions
                            
                                How to get name of datatable column?
                            
                                Cannot use `document.execCommand('copy');` from developer console
                            
                                Why comma ' , ' and plus ' + ' log the console output in different pattern?
                            
                                I am getting error - Error: cannot find module 'cordova-common' when installing Cordova
                            
                                Jackson deserialize single item into list
                            
                                Is the Heroku Git Repo Public?
                            
                                How to get the length of a cursor from mongodb using python?
                            
                                Automatically install extensions in VS Code?
                            
                                Bold or italic in C# or VB documentation comments?
                            
                                Angular2 Observable BehaviorSubject service not working
                            
                                C# async within an action
                            
                                Geolocation Without SSL connection

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With