Simplest feature selection algorithm

Question

I am trying to create my own and simple feature selection algorithm. The data set that I am going to work with is here (very famous data set). Can someone give me a pointer on how to do so?

I am planning to write a feature rank algorithm for a text classification. This is for a sentiment analysis of movie reviews, classifying them as either positive or negative.

So my question is on how to write a simple feature selection for a text data set.

Xolve · Accepted Answer

Feature selection methods are a big topic. You can start with following:

Chi square
Mutual information
Term frequency

etc. Read this paper if you have time: Comparative study on feature selection in text categorization this will help you lot.

The actual implementation depends on how you pre-process the data. Basically its keeping the counts, be it hash table or a database.

Has QUIT--Anony-Mousse · Answer

Random features work well, when you are then building ensembles. It's known as feature bagging.

Simplest feature selection algorithm

Tags:

algorithm

machine-learning

data-mining

semantic-analysis

aherlambang

2 Answers

Xolve

Has QUIT--Anony-Mousse

Recent Activity

Donate For Us

Simplest feature selection algorithm

Tags:

algorithm

machine-learning

data-mining

semantic-analysis

aherlambang

2 Answers

Xolve

Has QUIT--Anony-Mousse

Related questions

Recent Activity

Donate For Us