SGD model "overconfidence"

Question

I'm working on binary classification problem using Apache Mahout. The algorithm I use is OnlineLogisticRegression and the model which I currently have strongly tends to produce predictions which are either 1 or 0 without any middle values.

Please suggest a way to tune or tweak the algorithm to make it produce more intermediate values in predictions.

Thanks in advance!

Please suggest a way to tune or tweak the algorithm to make it produce more intermediate values in predictions.

Thanks in advance!

ogrisel · Accepted Answer

What is the test error rate of the classifier? If it's near zero then being confident is a feature, not a bug.

If the test error rate is high (or at least not low), then the classifier might be overfitting the training set: measure the difference between of the training error and the test error. In that case, increasing regularization as rrenaud suggested might help.

If your classifier is not overfitting, then there might be an issue with the probability calibration. Logistic Regression models (e.g. using the logit link function) should yield good enough probability calibrations (if the problem is approximately linearly separable and the label not too noisy). You can check the calibration of the probabilities with a plot as explained in this paper. If this is really a calibration issue, then implementing a custom calibration based on Platt scaling or isotonic regression might help fix the issue.

Rob Neuhaus · Answer

From reading the Mahout AbstractOnlineLogisticRegression docs, it looks like you can control the regularization parameter lambda. Increasing lambda should mean your weights are closer to 0, and hence your predictions are more hedged.

SGD model "overconfidence"

Tags:

machine-learning

classification

mahout

Alexander Oleynikov

2 Answers

ogrisel

Rob Neuhaus

Recent Activity

Donate For Us

SGD model "overconfidence"

Tags:

machine-learning

classification

mahout

Alexander Oleynikov

2 Answers

ogrisel

Rob Neuhaus

Related questions

Recent Activity

Donate For Us