Trained Machine Learning model is too big

1 Answers

You can try using joblib with compression parameter.

from sklearn.externals import joblib
joblib.dump(your_algo, 'pickle_file_name.pkl', compress=3)

compress - from 0 to 9. Higher value means more compression, but also slower read and write times. Using a value of 3 is often a good compromise.

You can use python standard compression modules zlib, gzip, bz2, lzma and xz. To use that you can just specify the format with specific extension

Example:

joblib.dump(obj, 'your_filename.pkl.z')   # zlib

More information, see the link.

103

answered Oct 17 '22 05:10

Rajish sani

Related questions
                            
                                How to mutate a list with a function in python?
                            
                                What does the "verbosity" parameter of a random forest mean? (sklearn)
                            
                                How to give foreign key name in django
                            
                                Accessing MySQL from Python 3: Access denied for user
                            
                                Python ASCII codec can't encode character error during write to CSV
                            
                                Tensorflow successfully installs on mac but gets ImportError on copyreg when used [closed]
                            
                                Calculating pairwise correlation among all columns
                            
                                "Map" a nested list in Python
                            
                                nltk StanfordNERTagger : NoClassDefFoundError: org/slf4j/LoggerFactory (In Windows)
                            
                                How to get the entire web page source using Selenium WebDriver in python [duplicate]
                            
                                Self-signed SSL connection using PyMongo
                            
                                PySpark: filtering a DataFrame by date field in range where date is string
                            
                                PyCharm doesn't autocomplete Django model queries anymore in 2016.1.2
                            
                                Removing white space from txt with python
                            
                                Increment matplotlib color cycle
                            
                                Flask-sqlalchemy disable autoflush for the whole session
                            
                                Extracting Pylint Score
                            
                                Python: Accessing YAML values using "dot notation"
                            
                                pandas remove seconds from datetime index
                            
                                How to install numpy+mkl for python 2.7 on windows 64 bit?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Trained Machine Learning model is too big

Tags:

python

machine-learning

pickle

random-forest

Itack

People also ask

1 Answers

Rajish sani

Recent Activity

Donate For Us