How to get odds-ratios and other related features with scikit-learn

Tags:

I'm going through this odds ratios in logistic regression tutorial, and trying to get the exactly the same results with the logistic regression module of scikit-learn. With the code below, I am able to get the coefficient and intercept but I could not find a way to find other properties of the model listed in the tutorial such as log-likelyhood, Odds Ratio, Std. Err., z, P>|z|, [95% Conf. Interval]. If someone could show me how to have them calculated with sklearn package, I would appreciate it.

import pandas as pd
from sklearn.linear_model import LogisticRegression

url = 'https://stats.idre.ucla.edu/wp-content/uploads/2016/02/sample.csv'
df = pd.read_csv(url, na_values=[''])
y = df.hon.values
X = df.math.values
y = y.reshape(200,1)
X = X.reshape(200,1)
clf = LogisticRegression(C=1e5)
clf.fit(X,y)
clf.coef_
clf.intercept_

485

asked Sep 21 '16 20:09

Erdem KAYA

2 Answers

You can get the odds ratios by taking the exponent of the coeffecients:

import numpy as np
X = df.female.values.reshape(200,1)
clf.fit(X,y)
np.exp(clf.coef_)

# array([[ 1.80891307]])

As for the other statistics, these are not easy to get from scikit-learn (where model evaluation is mostly done using cross-validation), if you need them you're better off using a different library such as statsmodels.

113

answered Sep 23 '22 18:09

maxymoo

In addition to @maxymoo's answer, to get other statistics, statsmodel can be used. Assuming that you have your data in a DataFrame called df, the code below should show a good summary:

import pandas as pd
from patsy import dmatrices
import statsmodels.api as sm 

y, X = dmatrices( 'label ~ age + gender', data=df, return_type='dataframe')
mod = sm.Logit(y, X)
res = mod.fit()
print res.summary()

answered Sep 22 '22 18:09

Erdem KAYA

Related questions
                            
                                Python: NameError: free variable 're' referenced before assignment in enclosing scope
                            
                                Selenium / Python - Selecting via css selector
                            
                                Empty list returned from ElementTree findall
                            
                                Redirect print to string list?
                            
                                How to change my django server time
                            
                                Integration of python in C# Application
                            
                                Python built-in sum function vs. for loop performance
                            
                                PyQt5: Keyboard shortcuts w/ QAction
                            
                                How to label and change the scale of Seaborn kdeplot's axes
                            
                                speech recognition python code not working
                            
                                Python HTML Encoding \xc2\xa0
                            
                                Replace all matches using re.findall()
                            
                                Python List object attribute 'append' is read-only
                            
                                Mock open() function used in a class method
                            
                                How to use pyinstaller?
                            
                                Python's json.load(sys.stdin) gets me u'...' instead of double quotes around Strings
                            
                                Why is a `for` over a Python list faster than over a Numpy array?
                            
                                Django annotate() error AttributeError: 'CharField' object has no attribute 'resolve_expression'
                            
                                Deprecated rolling window option in OLS from Pandas to Statsmodels
                            
                                Weighted correlation coefficient with pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get odds-ratios and other related features with scikit-learn

Tags:

python

scikit-learn

Erdem KAYA

People also ask

2 Answers

maxymoo

Erdem KAYA

Recent Activity

Donate For Us