I am using scikit learn 0.15.2 for a multi-class classification problem. I was getting a lot of DeprecationWarnings as follows when following examples like: scikit 0.14 multi label metrics until I started to use the MultiLabelBinarizer: "DeprecationWarning: Direct support for sequence of sequences multilabel representation will be unavailable from version 0.17. Use sklearn.preprocessing.MultiLabelBinarizer to convert to a label indicator representation." However, I cannot find a way to get the classification report (with precision, recall, f-measure) to work with it, as i was previously possible as shown here: scikit 0.14 multi label metrics I tried to use inverse_transform as below, this gives a classification_report but also gives the warnings again, that from 0.17 this code will break. How can I get measures for a multi-class classification problem? Example code: <pre class="prettyprint"><code>import numpy as np from sklearn.multiclass import OneVsRestClassifier from sklearn.preprocessing import MultiLabelBinarizer from sklearn.svm import LinearSVC from sklearn.metrics import classification_report # Some simple data: X_train = np.array([[0,0,0], [0,0,1], [0,1,0], [1,0,0], [1,1,1]]) y_train = [[1], [1], [1,2], [2], [2]] # Use MultiLabelBinarizer and train a multi-class classifier: mlb = MultiLabelBinarizer(sparse_output=True) y_train_mlb = mlb.fit_transform(y_train) clf = OneVsRestClassifier(LinearSVC()) clf.fit(X_train, y_train_mlb) # classification_report, here I did not find a way to use y_train_mlb, # I am getting a lot of DeprecationWarnings predictions_test = mlb.inverse_transform(clf.predict(X_train)) print classification_report(y_train, predictions_test) # Predict new example: print mlb.inverse_transform(clf.predict(np.array([0,1,0]))) </code></pre>

It seems like you have to run your classification report with the binarized labels: <pre class="prettyprint"><code>print classification_report(y_train_mlb, clf.predict(X_train)) </code></pre>

Scikit multi-class classification metrics, classification report

Tags:

I am using scikit learn 0.15.2 for a multi-class classification problem. I was getting a lot of DeprecationWarnings as follows when following examples like: scikit 0.14 multi label metrics until I started to use the MultiLabelBinarizer:

"DeprecationWarning: Direct support for sequence of sequences multilabel representation will be unavailable from version 0.17. Use sklearn.preprocessing.MultiLabelBinarizer to convert to a label indicator representation."

However, I cannot find a way to get the classification report (with precision, recall, f-measure) to work with it, as i was previously possible as shown here: scikit 0.14 multi label metrics

I tried to use inverse_transform as below, this gives a classification_report but also gives the warnings again, that from 0.17 this code will break.

How can I get measures for a multi-class classification problem?

Example code:

import numpy as np
from sklearn.multiclass import OneVsRestClassifier
from sklearn.preprocessing import MultiLabelBinarizer
from sklearn.svm import LinearSVC
from sklearn.metrics import classification_report

# Some simple data:

X_train = np.array([[0,0,0], [0,0,1], [0,1,0], [1,0,0], [1,1,1]])
y_train = [[1], [1], [1,2], [2], [2]]

# Use MultiLabelBinarizer and train a multi-class classifier:

mlb = MultiLabelBinarizer(sparse_output=True)
y_train_mlb = mlb.fit_transform(y_train)

clf = OneVsRestClassifier(LinearSVC())
clf.fit(X_train, y_train_mlb)

# classification_report, here I did not find a way to use y_train_mlb, 
# I am getting a lot of DeprecationWarnings

predictions_test = mlb.inverse_transform(clf.predict(X_train))
print classification_report(y_train, predictions_test)

# Predict new example:

print mlb.inverse_transform(clf.predict(np.array([0,1,0])))

650

asked May 14 '15 22:05

tkja

1 Answers

It seems like you have to run your classification report with the binarized labels:

print classification_report(y_train_mlb, clf.predict(X_train))

answered Oct 21 '22 20:10

elachell

Related questions
                            
                                Replace slice of a numpy array with values from another array
                            
                                No module named main, wkhtmltopdf issue
                            
                                How to put Python 3.4 matplotlib in non-interactive mode?
                            
                                Equivalent of Python's dir function in PHP
                            
                                Using model inheritance and encounting by non-nullable field error
                            
                                Aligning Japanese characters in python
                            
                                Unit Test Behavior with Patch (Flask)
                            
                                value error happens when using GridSearchCV
                            
                                Dictionary __gt__ and __lt__ implementation
                            
                                np.exp much slower than np.e?
                            
                                How to draw cubic spline in matplotlib
                            
                                Scipy ConvexHull and QHull: rank/dimension is not maximal
                            
                                Python logging creates empty files
                            
                                Passing java object to python
                            
                                Detect if method is decorated before invoking it
                            
                                How to display pre-colored string with curses?
                            
                                How can I find Python methods without return statements?
                            
                                Convert Pandas TimeDelta to integer
                            
                                Is there a way of putting the Python Shell output in a tkinter window?
                            
                                Is it possible to speed up interactive IPython Notebook plots by not generating new figures every time?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scikit multi-class classification metrics, classification report

Tags:

python

machine-learning

scikit-learn

scikits

tkja

People also ask

1 Answers

elachell

Recent Activity

Donate For Us