The following is adaboost algorithm: <img src="https://i.stack.imgur.com/7p2mE.png" alt="enter image description here"> It mentions "using weights wi on the training data" at part 3.1. I am not very clear about how to use the weights. Should I resample the training data?

<blockquote> I am not very clear about how to use the weights. Should I resample the training data? </blockquote> It depends on what classifier you are using. If your classifier can take instance weight (weighted training examples) into account, then you don't need to resample the data. An example classifier could be naive bayes classifier that accumulates weighted counts or a weighted k-nearest-neighbor classifier. Otherwise, you want to resample the data using the instance weight, i.e., those instance with more weights could be sampled multiple times; while those instance with little weight might not even appear in the training data. Most of the other classifiers fall in this category. <h3>In Practice</h3> Actually in practice, boosting performs better if you only rely on a pool of very naive classifiers, e.g., decision stump, linear discriminant. In this case, the algorithm you listed has a easy-to-implement form (see here for details): <img src="https://i.stack.imgur.com/w3HYR.png" alt="enter image description here"> Where alpha is chosen by (epsilon is defined similarly as yours). <img src="https://i.stack.imgur.com/Xvq8C.png" alt="enter image description here"> <h3>An Example</h3> <blockquote> Define a two-class problem in the plane (for example, a circle of points inside a square) and build a strong classier out of a pool of randomly generated linear discriminants of the type sign(ax1 + bx2 + c). </blockquote> The two class labels are represented with red crosses and blue dots. We here are using a bunch of linear discriminants (yellow lines) to construct the pool of naive/weak classifiers. We generate 1000 data points for each class in the graph (inside the circle or not) and 20% of data is reserved for testing. <img src="https://i.stack.imgur.com/IPvBO.png" alt="enter image description here"> This is the classification result (in the test dataset) I got, in which I used 50 linear discriminants. The training error is 1.45% and the testing error is 2.3% <img src="https://i.stack.imgur.com/QkIgc.png" alt="enter image description here">

how to use weight when training a weak learner for adaboost

1 Answers

I am not very clear about how to use the weights. Should I resample the training data?

It depends on what classifier you are using.

If your classifier can take instance weight (weighted training examples) into account, then you don't need to resample the data. An example classifier could be naive bayes classifier that accumulates weighted counts or a weighted k-nearest-neighbor classifier.

Otherwise, you want to resample the data using the instance weight, i.e., those instance with more weights could be sampled multiple times; while those instance with little weight might not even appear in the training data. Most of the other classifiers fall in this category.

In Practice

Actually in practice, boosting performs better if you only rely on a pool of very naive classifiers, e.g., decision stump, linear discriminant. In this case, the algorithm you listed has a easy-to-implement form (see here for details): enter image description here Where alpha is chosen by (epsilon is defined similarly as yours).

enter image description here

An Example

Define a two-class problem in the plane (for example, a circle of points inside a square) and build a strong classier out of a pool of randomly generated linear discriminants of the type sign(ax1 + bx2 + c).

The two class labels are represented with red crosses and blue dots. We here are using a bunch of linear discriminants (yellow lines) to construct the pool of naive/weak classifiers. We generate 1000 data points for each class in the graph (inside the circle or not) and 20% of data is reserved for testing.

enter image description here

This is the classification result (in the test dataset) I got, in which I used 50 linear discriminants. The training error is 1.45% and the testing error is 2.3%

enter image description here

168

answered Oct 05 '22 05:10

greeness

Related questions
                            
                                Keras. ValueError: I/O operation on closed file
                            
                                Cross validation with grid search returns worse results than default
                            
                                Is it possible to add your own WordNet to a library?
                            
                                Supervised Motion Detection Library
                            
                                Assign new data point to cluster in kernel k-means (kernlab package in R)?
                            
                                How to obtain information gain from a scikit-learn DecisionTreeClassifier?
                            
                                Python's implementation of Mutual Information
                            
                                what's the use of transformer_weights in scikit-learn pipeline?
                            
                                difference between LinearRegression and svm.SVR(kernel="linear")
                            
                                What are good algorithms for detecting abnormality?
                            
                                Miminum requirements for Google tensorflow image classifier
                            
                                Fit mixture of Gaussians with fixed covariance in Python
                            
                                How to combine TFIDF features with other features
                            
                                Perceptron learning algorithm doesn't work
                            
                                Representing Natural Language as RDF
                            
                                Scikit-learn χ² (chi-squared) statistic and corresponding contingency table
                            
                                sklearn: How to reset a Regressor or classifier object in sknn
                            
                                Input/output error while using google colab with google drive
                            
                                AssertionError: Could not compute output Tensor
                            
                                How to plot a learning curve for a keras experiment?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to use weight when training a weak learner for adaboost

Tags:

machine-learning

adaboost

tidy

People also ask

1 Answers

In Practice

An Example

greeness

Recent Activity

Donate For Us