which coefficients go to which class in multiclass logistic regression in scikit learn?

Tags:

I'm using scikit learn's Logistic Regression for a multiclass problem.

logit = LogisticRegression(penalty='l1')
logit = logit.fit(X, y)

I'm interested in which features are driving this decision.

logit.coef_

The above gives me a beautiful dataframe in (n_classes, n_features) format, but all the classes and feature names are gone. With features, that's okay, because making the assumption that they're indexed the same way as I passed them in seems safe...

But with classes, it's a problem, since I never explicitly passed in the classes in any order. So which class do coefficient sets (rows in the dataframe) 0, 1, 2, and 3 belong to?

797

asked Apr 25 '17 23:04

Alex Lenail

1 Answers

The order will be same as returned by the logit.classes_ (classes_ is an attribute of the fitted model, which represents the unique classes present in y) and mostly they will be arranged alphabetically in case of strings.

To explain it, we the above mentioned labels y on an random dataset with LogisticRegression:

import numpy as np
from sklearn.linear_model import LogisticRegression

X = np.random.rand(45,5)
y = np.array(['GR3', 'GR4', 'SHH', 'GR3', 'GR4', 'SHH', 'GR4', 'SHH',
              'GR4', 'WNT', 'GR3', 'GR4', 'GR3', 'SHH', 'SHH', 'GR3', 
              'GR4', 'SHH', 'GR4', 'GR3', 'SHH', 'GR3', 'SHH', 'GR4', 
              'SHH', 'GR3', 'GR4', 'GR4', 'SHH', 'GR4', 'SHH', 'GR4', 
              'GR3', 'GR3', 'WNT', 'SHH', 'GR4', 'SHH', 'SHH', 'GR3',
              'WNT', 'GR3', 'GR4', 'GR3', 'SHH'], dtype=object)

lr = LogisticRegression()
lr.fit(X,y)

# This is what you want
lr.classes_

#Out:
#    array(['GR3', 'GR4', 'SHH', 'WNT'], dtype=object)

lr.coef_
#Out:
#    array of shape [n_classes, n_features]

So in the coef_ matrix, the index 0 in rows represents the 'GR3' (the first class in classes_ array, 1 = 'GR4' and so on.

Hope it helps.

answered Oct 19 '22 12:10

Vivek Kumar

Related questions
                            
                                Change attrs within HTML tag to view full content Python BeautifulSoup
                            
                                why does get_tensor_by_name require appending a port to the tensor name
                            
                                Corner detection in Image processing Opencv Python
                            
                                Looping through a list of pandas dataframes
                            
                                Function that accepts both expanded arguments and tuple
                            
                                PyLatex How to add custom usepackage
                            
                                Pandas Groupby Consistent levels even if empty
                            
                                Selecting rows before and after rows of interest in Pandas
                            
                                python sockets multiple messages on same connection
                            
                                How to fetch both live video frame and timestamp from ffmpeg to python on Windows
                            
                                Pandas map string to int based on value in a column
                            
                                ImportError: No module named 'matplotlib' -- Using Anaconda tensorflow environment
                            
                                How to do batching in Tensorflow Serving?
                            
                                How to specify text mode in Python's tempfile.TemporaryFile()?
                            
                                Defining a default argument as a global variable
                            
                                ValueError: Argument must be a dense tensor - Python and TensorFlow
                            
                                How to convert numbers represented as characters for short into numeric in Python
                            
                                Python pandas series: convert float to string, preserving nulls
                            
                                pandas equivalent for grep
                            
                                How to remove automated chart titles generated by Pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

which coefficients go to which class in multiclass logistic regression in scikit learn?

Tags:

python

scikit-learn

logistic-regression

Alex Lenail

People also ask

1 Answers

Vivek Kumar

Recent Activity

Donate For Us