Python statsmodels OLS: how to save learned model to file

Tags:

I am trying to learn an ordinary least squares model using Python's statsmodels library, as described here.

sm.OLS.fit() returns the learned model. Is there a way to save it to the file and reload it? My training data is huge and it takes around half a minute to learn the model. So I was wondering if any save/load capability exists in OLS model.

I tried the repr() method on the model object but it does not return any useful information.

777

asked May 07 '13 08:05

Nik

2 Answers

The models and results instances all have a save and load method, so you don't need to use the pickle module directly.

Edit to add an example:

import statsmodels.api as sm

data = sm.datasets.longley.load_pandas()

data.exog['constant'] = 1

results = sm.OLS(data.endog, data.exog).fit()
results.save("longley_results.pickle")

# we should probably add a generic load to the main namespace
from statsmodels.regression.linear_model import OLSResults
new_results = OLSResults.load("longley_results.pickle")

# or more generally
from statsmodels.iolib.smpickle import load_pickle
new_results = load_pickle("longley_results.pickle")

Edit 2 We've now added a load method to main statsmodels API in master, so you can just do

new_results = sm.load('longley_results.pickle')

answered Oct 26 '22 04:10

jseabold

I've installed the statsmodels library and found that you can save the values using the pickle module in python.

Models and results are pickleable via save/load, optionally saving the model data. [source]

As an example:

Given that you have the results saved in the variable results:

To save the file:

import pickle    
with open('learned_model.pkl','w') as f:
  pickle.dump(results,f)

To read the file:

import pickle
with open('learned_model.pkl','r') as f:
  model_results = pickle.load(f)

answered Oct 26 '22 04:10

RMcG

Related questions
                            
                                Selenium: Element not clickable ... Other Element Would Receive Click
                            
                                Why is a list access O(1) in Python?
                            
                                Leveraging "Copy-on-Write" to Copy Data to Multiprocessing.Pool() Worker Processes
                            
                                How to align axis label to the right or top in matplotlib?
                            
                                How to parse tsv file with python?
                            
                                list comprehension for multiple return function?
                            
                                Associating string representations with an Enum that uses integer values?
                            
                                Pytest - how to skip tests unless you declare an option/flag?
                            
                                Correlation matrix plot with coefficients on one side, scatterplots on another, and distributions on diagonal
                            
                                Should Python unittests be in a separate module?
                            
                                What does "lambda" mean in Python, and what's the simplest way to use it?
                            
                                load python code at runtime
                            
                                python string format suppress/silent keyerror/indexerror [duplicate]
                            
                                Improving Performance of Django ForeignKey Fields in Admin
                            
                                Django admin display multiple fields on the same line
                            
                                Dynamic choices field in Django Models
                            
                                How can I include a python package with Hadoop streaming job?
                            
                                Unicode encoding for filesystem in Mac OS X not correct in Python?
                            
                                how to create a dictionary using two lists in python? [duplicate]
                            
                                Index Error: list index out of range (Python) [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python statsmodels OLS: how to save learned model to file

Tags:

python

statsmodels

least-squares

Nik

People also ask

2 Answers

jseabold

RMcG

Recent Activity

Donate For Us