GridSearch for Multilabel OneVsRestClassifier?

Tags:

scikit-learn

I'm doing a grid search over multilabel data as follows:

#imports
from sklearn.svm import SVC as classifier
from sklearn.pipeline import Pipeline
from sklearn.decomposition import RandomizedPCA
from sklearn.cross_validation import StratifiedKFold
from sklearn.grid_search import GridSearchCV

#classifier pipeline
clf_pipeline = clf_pipeline = OneVsRestClassifier(
                Pipeline([('reduce_dim', RandomizedPCA()),
                          ('clf', classifier())
                          ]
                         ))

C_range = 10.0 ** np.arange(-2, 9)
gamma_range = 10.0 ** np.arange(-5, 4)
n_components_range = (10, 100, 200)
degree_range = (1, 2, 3, 4)

param_grid = dict(estimator__clf__gamma=gamma_range,
                  estimator__clf__c=c_range,
                  estimator__clf__degree=degree_range,
                  estimator__reduce_dim__n_components=n_components_range)

grid = GridSearchCV(clf_pipeline, param_grid,
                                cv=StratifiedKFold(y=Y, n_folds=3), n_jobs=1,
                                verbose=2)
grid.fit(X, Y)

I'm seeing the following traceback:

/Users/andrewwinterman/Documents/sparks-honey/classifier/lib/python2.7/site-packages/sklearn/grid_search.pyc in fit_grid_point(X, y, base_clf, clf_params, train, test, loss_func, score_func, verbose, **fit_params)
    107 
    108     if y is not None:
--> 109         y_test = y[safe_mask(y, test)]
    110         y_train = y[safe_mask(y, train)]
    111         clf.fit(X_train, y_train, **fit_params)

TypeError: only integer arrays with one element can be converted to an index

Looks like GridSearchCV objects to multiple labels. How should I work around this? Do I need to explicitly iterate through the unique classes with label_binarizer, running grid search on each sub-estimator?

320

asked Jan 08 '13 23:01

Maus

1 Answers

I think there is a bug in grid_search.py

Have you tried to give y as numpy array?

import numpy as np
Y = np.asarray(Y)

159

answered Oct 19 '22 05:10

Baskaya

Related questions
                            
                                Running Salome script without graphics
                            
                                Counting with scipy.sparse
                            
                                How to use nested transaction with scoped session in SQLAlchemy?
                            
                                Efficient way to check existence in a large set of strings
                            
                                How to create multidimensional array with numpy.mgrid
                            
                                Python minidom/xml : How to set node text with minidom api
                            
                                openpyxl please do not assume text as a number when importing
                            
                                Image Comparison for vector images (based on edge detection)?
                            
                                How to change the layout of a Gtk application on fullscreen?
                            
                                boost python method calls with reference arguments
                            
                                Boost.Python - Passing boost::python::object as argument to python function?
                            
                                How to import a module but ignoring the package's __init__.py?
                            
                                Get EXIF data without downloading whole image - Python
                            
                                Python: How can I get a list of function names from within __getattr__ function?
                            
                                How do I see if a domain uses DNSSEC
                            
                                Is there a python template library that can do "partial renderings"?
                            
                                OneToOneField with null=True doesn't allow empty field
                            
                                Adaptive plotting of a function in python
                            
                                Python permutations threads
                            
                                Global paster command not found in virtualenv

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With