What is the best way to save sklearn model?

Tags:

I am working on a python desktop app. This app does some predictions. Right now I train my sklearn model using python script, save the parameters of the model as a dictionary in a yaml file. Then, I build in this yaml into my python app. Then, when I am using the app, the model is recreated using parameters from the dictionary. I realized, that people who have a different version of sklearn get an error. I tried to save my model in a pickle file, but in this case, it produced some warning when app was running on a machine with a different version of sklearn.

943

asked Oct 05 '17 16:10

Ekaterina Tcareva

1 Answers

There is no guarantee that a given sklearn model would be compatible between versions of sklearn. Indeed, the implementation or the internal API may change between versions. See more informations here.

If you consider one version, the best way is indeed to pickle, and not to save the parameters in a yaml file. It's even better to use joblib to do so. See more informations here.

133

answered Oct 10 '22 02:10

TomDLT

Related questions
                            
                                Print version of a module without importing the entire package
                            
                                What's the functional difference between `etree.fromstring()` and `etree.XML()` in lxml?
                            
                                Understanding __call__ with metaclasses [duplicate]
                            
                                AttributeError: 'Series' object has no attribute 'rolling'
                            
                                Add months to date column in Spark dataframe
                            
                                Replacing only the captured group using re.sub and multiple replacements
                            
                                tensorflow object detection Fine-tuning a model from an existing checkpoint
                            
                                Updating z data on a surface_plot in Matplotlib animation
                            
                                Should super always be at the top of an __init__ method, or can it be at the bottom?
                            
                                How do I get the number of likes on a tweet via tweepy?
                            
                                Conda build unsatisfiable dependencies error with pint
                            
                                Why do I get warning "QStandardPaths: XDG_RUNTIME_DIR not set" every time for a PyQt5 project
                            
                                pandas apply function on multiindex
                            
                                Running Matlab using Python gives 'No module named matlab.engine' error
                            
                                Python’s empty function does not require a pass statement? [closed]
                            
                                Using numpy in AWS Lambda
                            
                                Scheduling a python script on Azure
                            
                                How to mock getenv in pytest?
                            
                                SQlite3 - Delete row by rowid
                            
                                PyQt4 to PyQt5 how?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the best way to save sklearn model?

Tags:

python

scikit-learn

Ekaterina Tcareva

People also ask

1 Answers

TomDLT

Recent Activity

Donate For Us