Which classification algorithm to choose?

Question

I would like to classify text documents into four categories. Also I have lot of samples which are already classified that can be used for training. I would like the algorithm to learn on the fly.. please suggest an optimal algorithm that works for this requirement.

Fred Foo · Accepted Answer

If by "on the fly" you mean online learning (where training and classification can be interleaved), I suggest the k-nearest neighbor algorithm. It's available in Weka and in the package TiMBL.

A perceptron will also be able to do this.

"Optimal" isn't a well-defined term in this context.

yura · Answer

there are several algorithms which can be learned on fly. Examples: k-nearest neighbors, naive Bayes, neural networks. You can try how appropriate each of these methods are on a sample corpus.

Which classification algorithm to choose?

Tags:

machine-learning

classification

data-mining

infotiger

2 Answers

Fred Foo

yura

Recent Activity

Donate For Us

Which classification algorithm to choose?

Tags:

machine-learning

classification

data-mining

infotiger

2 Answers

Fred Foo

yura

Related questions

Recent Activity

Donate For Us