Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Skip feature when classifying, but show feature in output

Tags:

weka

I've created a dataset which contains +/- 13000 rows with +/- 50 features. I know how to output every classification result: prediction and actual, but I would like to be able to output some sort of ID with those results. So i've added a ID column to my dataset but I don't know how disregard the ID when classifying while still being able to output the ID with every prediction result. I do know how to select features to output with every prediction.

like image 845
zeebonk Avatar asked Apr 06 '12 18:04

zeebonk


2 Answers

Use FilteredClassifier. See this and this .

like image 191
iinception Avatar answered Oct 14 '22 00:10

iinception


Let's say follwoing are the attributes in the bbcsport.arff that you want to remove and is in a file attributes.txt line by line..

serena
serve
service
sets
striking
tennis
tiebreak
tournaments
wimbledon
..
Here is how you may include or exclude the attributes by setting true or false. (mutually elusive) remove.setInvertSelection(false)

BufferedReader datafile = new BufferedReader(new FileReader("bbcsport.arff")); 
BufferedReader attrfile = new BufferedReader(new FileReader("attributes.txt"));

Instances data = new Instances(datafile); 
List<Integer> myList = new ArrayList<Integer>();
String line;

while ((line = attrfile.readLine()) != null) {
  for (n = 0; n < data.numAttributes(); n++) {
    if (data.attribute(n).name().equalsIgnoreCase(line)) {
      if(!myList.contains(n)) 
        myList.add(n); 
    } 
  }
}

int[] attrs = myList.stream().mapToInt(i -> i).toArray();
Remove remove = new Remove();
remove.setAttributeIndicesArray(attrs);
remove.setInvertSelection(false);
remove.setInputFormat(data); // init filter

Instances filtered = Filter.useFilter(data, remove);

'filtered' has the final attributes..

My blog .. http://ojaslabs.com/include-exclude-attributes-in-weka

like image 31
Sivakumar D Avatar answered Oct 14 '22 01:10

Sivakumar D