I am doing a text classification task. Now I want to use <code>ensemble.AdaBoostClassifier</code> with <code>LinearSVC</code> as <code>base_estimator</code>. However, when I try to run the code <pre class="prettyprint"><code>clf = AdaBoostClassifier(svm.LinearSVC(),n_estimators=50, learning_rate=1.0, algorithm='SAMME.R') clf.fit(X, y) </code></pre> An error occurred. <code>TypeError: AdaBoostClassifier with algorithm='SAMME.R' requires that the weak learner supports the calculation of class probabilities with a predict_proba method</code> The first question is Cannot the <code>svm.LinearSVC()</code> calculate the class probabilities ? How to make it calculate the probabilities? Then I Change the parameter <code>algorithm</code> and run the code again. <pre class="prettyprint"><code>clf = AdaBoostClassifier(svm.LinearSVC(),n_estimators=50, learning_rate=1.0, algorithm='SAMME') clf.fit(X, y) </code></pre> This time <code>TypeError: fit() got an unexpected keyword argument 'sample_weight'</code> happens. As is said in AdaBoostClassifier, <code>Sample weights. If None, the sample weights are initialized to 1 / n_samples.</code> Even if I assign an integer to <code>n_samples</code>, error also occurred. The second question is What does <code>n_samples</code> mean? How to solve this problem? Hope anyone could help me. According to @jme 's comment, however, after trying <pre class="prettyprint"><code>clf = AdaBoostClassifier(svm.SVC(kernel='linear',probability=True),n_estimators=10, learning_rate=1.0, algorithm='SAMME.R') clf.fit(X, y) </code></pre> The program cannot get a result and the memory used on the server keeps unchanged. The third question is how I can make <code>AdaBoostClassifier</code> work with <code>SVC</code> as base_estimator?

The right answer will depend on exactly what you're looking for. LinearSVC cannot predict class probabilities (required by default algorithm used by AdaBoostClassifier) and does not support sample_weight. You should be aware that the Support Vector Machine does not nominally predict class probabilities. They are computed using Platt scaling (or an extension of Platt scaling in the multi-class case), a technique which has known issues. If you need less "artificial" class probabilities, an SVM might not be the way to go. With that said, I believe the most satisfying answer given your question would be that given by Graham. That is, <pre class="prettyprint"><code>from sklearn.svm import SVC from sklearn.ensemble import AdaBoostClassifier clf = AdaBoostClassifier(SVC(probability=True, kernel='linear'), ...) </code></pre> You have other options. You can use SGDClassifier with a hinge loss function and set AdaBoostClassifier to use the SAMME algorithm (which does not require a predict_proba function, but does require support for sample_weight): <pre class="prettyprint"><code>from sklearn.linear_model import SGDClassifier clf = AdaBoostClassifier(SGDClassifier(loss='hinge'), algorithm='SAMME', ...) </code></pre> Perhaps the best answer would be to use a classifier that has native support for class probabilities, like Logistic Regression, if you wanted to use the default algorithm provided for AdaBoostClassifier. You can do this using scikit.linear_model.LogisticRegression or using SGDClassifier with a log loss function, as used in the code provided by Kris. Hope that helps, if you're curious about what Platt scaling is, check out the original paper by John Platt here.

sklearn.ensemble.AdaBoostClassifier cannot accecpt SVM as base_estimator?

Tags:

python

machine-learning

scikit-learn

ensemble-learning

I am doing a text classification task. Now I want to use ensemble.AdaBoostClassifier with LinearSVC as base_estimator. However, when I try to run the code

clf = AdaBoostClassifier(svm.LinearSVC(),n_estimators=50, learning_rate=1.0,    algorithm='SAMME.R')
clf.fit(X, y)

An error occurred. TypeError: AdaBoostClassifier with algorithm='SAMME.R' requires that the weak learner supports the calculation of class probabilities with a predict_proba method

The first question is Cannot the svm.LinearSVC() calculate the class probabilities ? How to make it calculate the probabilities?

Then I Change the parameter algorithm and run the code again.

clf = AdaBoostClassifier(svm.LinearSVC(),n_estimators=50, learning_rate=1.0, algorithm='SAMME')
clf.fit(X, y)

This time TypeError: fit() got an unexpected keyword argument 'sample_weight' happens. As is said in AdaBoostClassifier, Sample weights. If None, the sample weights are initialized to 1 / n_samples. Even if I assign an integer to n_samples, error also occurred.

The second question is What does n_samples mean? How to solve this problem?

Hope anyone could help me.

According to @jme 's comment, however, after trying

clf = AdaBoostClassifier(svm.SVC(kernel='linear',probability=True),n_estimators=10,  learning_rate=1.0, algorithm='SAMME.R')
clf.fit(X, y)

The program cannot get a result and the memory used on the server keeps unchanged.

The third question is how I can make AdaBoostClassifier work with SVC as base_estimator?

583

asked Nov 24 '14 14:11

allenwang

1 Answers

The right answer will depend on exactly what you're looking for. LinearSVC cannot predict class probabilities (required by default algorithm used by AdaBoostClassifier) and does not support sample_weight.

You should be aware that the Support Vector Machine does not nominally predict class probabilities. They are computed using Platt scaling (or an extension of Platt scaling in the multi-class case), a technique which has known issues. If you need less "artificial" class probabilities, an SVM might not be the way to go.

With that said, I believe the most satisfying answer given your question would be that given by Graham. That is,

from sklearn.svm import SVC
from sklearn.ensemble import AdaBoostClassifier

clf = AdaBoostClassifier(SVC(probability=True, kernel='linear'), ...)

You have other options. You can use SGDClassifier with a hinge loss function and set AdaBoostClassifier to use the SAMME algorithm (which does not require a predict_proba function, but does require support for sample_weight):

from sklearn.linear_model import SGDClassifier

clf = AdaBoostClassifier(SGDClassifier(loss='hinge'), algorithm='SAMME', ...)

Perhaps the best answer would be to use a classifier that has native support for class probabilities, like Logistic Regression, if you wanted to use the default algorithm provided for AdaBoostClassifier. You can do this using scikit.linear_model.LogisticRegression or using SGDClassifier with a log loss function, as used in the code provided by Kris.

Hope that helps, if you're curious about what Platt scaling is, check out the original paper by John Platt here.

152

answered Sep 22 '22 16:09

kevin

Related questions
                            
                                Pyramid: simpleform or deform?
                            
                                Using MongoDB as our master database, should I use a separate graph database to implement relationships between entities?
                            
                                Eggs in path before PYTHONPATH environment variable
                            
                                Can the pudb debugger be used on windows?
                            
                                Passing variables between Python and Javascript
                            
                                Get a list of python packages used by a Django Project
                            
                                Writing functions that accept both 1-D and 2-D numpy arrays?
                            
                                Catching exceptions in django templates
                            
                                Stackless in PyPy and PyPy + greenlet - differences
                            
                                git cannot execute python-script as hook
                            
                                How to stop a python socket.accept() call?
                            
                                Conditional shebang line for different versions of Python
                            
                                OpenCV - imread(), imwrite() increases the size of png?
                            
                                Using methods defined in __init__.py within the module
                            
                                Combining websockets and WSGI in a python app
                            
                                Load Excel file into numpy 2D array
                            
                                How to transmit Android real-time sensor data to computer?
                            
                                python imaging library: Can I simply fill my image with one color?
                            
                                Why Python need rich comparison?
                            
                                sample weights in scikit-learn broken in cross validation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With