Skip forbidden parameter combinations when using GridSearchCV

Tags:

I want to greedily search the entire parameter space of my support vector classifier using GridSearchCV. However, some combinations of parameters are forbidden by LinearSVC and throw an exception. In particular, there are mutually exclusive combinations of the dual, penalty, and loss parameters:

For example, this code:

Click to copy

from sklearn import svm, datasets
from sklearn.model_selection import GridSearchCV

iris = datasets.load_iris()
parameters = {'dual':[True, False], 'penalty' : ['l1', 'l2'], \
              'loss': ['hinge', 'squared_hinge']}
svc = svm.LinearSVC()
clf = GridSearchCV(svc, parameters)
clf.fit(iris.data, iris.target)

Returns ValueError: Unsupported set of arguments: The combination of penalty='l2' and loss='hinge' are not supported when dual=False, Parameters: penalty='l2', loss='hinge', dual=False

My question is: is it possible to make GridSearchCV skip combinations of parameters which the model forbids? If not, is there an easy way to construct a parameter space which won't violate the rules?

648

asked Mar 24 '17 21:03

crypdick

1 Answers

I solved this problem by passing error_score=0.0 to GridSearchCV:

error_score : ‘raise’ (default) or numeric

Value to assign to the score if an error occurs in estimator fitting. If set to ‘raise’, the error is raised. If a numeric value is given, FitFailedWarning is raised. This parameter does not affect the refit step, which will always raise the error.

UPDATE: newer versions of sklearn print out a bunch of ConvergenceWarning and FitFailedWarning. I had a hard time surppressing them with contextlib.suppress, but there is a hack around that involving a testing context manager:

Click to copy

from sklearn import svm, datasets 
from sklearn.utils._testing import ignore_warnings 
from sklearn.exceptions import FitFailedWarning, ConvergenceWarning 
from sklearn.model_selection import GridSearchCV 

with ignore_warnings(category=[ConvergenceWarning, FitFailedWarning]): 
    iris = datasets.load_iris() 
    parameters = {'dual':[True, False], 'penalty' : ['l1', 'l2'], \ 
                 'loss': ['hinge', 'squared_hinge']} 
    svc = svm.LinearSVC() 
    clf = GridSearchCV(svc, parameters, error_score=0.0) 
    clf.fit(iris.data, iris.target)

answered Sep 28 '22 01:09

crypdick

Related questions
                            
                                Python 2 - How would you round up/down to the nearest 6 minutes?
                            
                                Python using ZIP64 extensions when compressing large files
                            
                                Splitting columns of a numpy array easily
                            
                                Iterate over deque in python
                            
                                Using variables in the format() function in Python
                            
                                python pandas: how to find rows in one dataframe but not in another?
                            
                                How to run an function when anything changes in a dir with Python Watchdog?
                            
                                HTTPError: HTTP Error 503: Service Unavailable goslate language detection request : Python
                            
                                How to search for the last occurrence of a regular expression in a string in python?
                            
                                How can I dynamically render images from my images folder using Jinja and Flask?
                            
                                Viewing .npy images
                            
                                Using PythonService.exe to host python service while using virtualenv
                            
                                Python finding difference between two time stamps in minutes
                            
                                How to create a Manhattan plot with matplotlib in python?
                            
                                Fast Numpy Loops
                            
                                Is it possible to get the contents of an S3 file without downloading it using boto3?
                            
                                Unable to run a basic GraphFrames example
                            
                                Is there a keyboard shortcut in Pycharm for renaming a specific variable?
                            
                                How to embed matplotlib graph in Django webpage?
                            
                                How to dill (pickle) to file?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Skip forbidden parameter combinations when using GridSearchCV

Tags:

python

optimization

scikit-learn

svc

grid-search

crypdick

People also ask

1 Answers

crypdick

Recent Activity

Donate For Us