Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to get predication value for an instance in weka?

Tags:

java

weka

I am working on Weka and need to output the predication values (probabilities) of each labels for each test instance.

In GUI there is an option in classify tab as (classify -> options -> Output predicted value) which does this work by outputting the prediction probabilities for each label but how to do this in java code. I want to receive probability scores for each label after classifying it ?

like image 829
Kashif Khan Avatar asked Dec 16 '13 07:12

Kashif Khan


2 Answers

The following code takes in a set of training instances, and outputs the predicted probability for a specific instance.


import weka.classifiers.trees.J48;
import weka.core.Instances;

public class Main {

    public static void main(String[] args) throws Exception
    {
        //load training instances
        Instances test=...

        //build a J48 decision tree
        J48 model=new J48(); 
        model.buildClassifier(test);

        //decide which instance you want to predict
        int s1=2;

        //get the predicted probabilities 
        double[] prediction=model.distributionForInstance(test.get(s1));

        //output predictions
        for(int i=0; i<prediction.length; i=i+1)
        {
            System.out.println("Probability of class "+
                                test.classAttribute().value(i)+
                               " : "+Double.toString(prediction[i]));
        }

    }

}

The method "distributionForInstance" only works for classifiers capable of outputting distribution predictions. You can read up on it here.

like image 105
Walter Avatar answered Nov 16 '22 03:11

Walter


I think I found the solution.

The training set and the test set must be equal: same header, same name of attributes, same order. Only changes the numbers. And the question is: why do I have to put the class in the test set if I don’t know it, and precisely it is what I want to obtain? It seems that the method needs something on that, but when you have a look at classModel.distributionForInstance(dataModel.instance(0)) , it gives you the prediction on your classes with an array of double. So, it is necessary to put some values of the classes in the test set, and later the ‘distributionForInstance’ gives you the real result for your classes.

like image 27
Txus Lopez Avatar answered Nov 16 '22 04:11

Txus Lopez