How do I save the StandardScaler() model in Sklearn? I need to make a model operational and don't want to load training data agian and again for StandardScaler to learn and then apply on new data on which I want to make predictions. <pre class="prettyprint"><code>from sklearn.preprocessing import StandardScaler from sklearn.model_selection import train_test_split #standardizing after splitting X_train, X_test, y_train, y_test = train_test_split(data, target) sc = StandardScaler() X_train_std = sc.fit_transform(X_train) X_test_std = sc.transform(X_test) </code></pre>

you could use joblib dump function to save the standard scaler model. Here's a complete example for reference. <pre class="prettyprint"><code>from sklearn.preprocessing import StandardScaler from sklearn.model_selection import train_test_split from sklearn.datasets import load_iris data, target = load_iris(return_X_y=True) X_train, X_test, y_train, y_test = train_test_split(data, target) sc = StandardScaler() X_train_std = sc.fit_transform(X_train) </code></pre> if you want to save the sc standardscaller use the following <pre class="prettyprint"><code>from sklearn.externals.joblib import dump, load dump(sc, 'std_scaler.bin', compress=True) </code></pre> this will create the file std_scaler.bin and save the sklearn model. To read the model later use load <pre class="prettyprint"><code>sc=load('std_scaler.bin') </code></pre> Note: <code>sklearn.externals.joblib</code> is deprecated. Install and use the pure <code>joblib</code> instead

Saving StandardScaler() model for use on new datasets

Tags:

python-3.x

scikit-learn

How do I save the StandardScaler() model in Sklearn? I need to make a model operational and don't want to load training data agian and again for StandardScaler to learn and then apply on new data on which I want to make predictions.

from sklearn.preprocessing import StandardScaler
from sklearn.model_selection import train_test_split

#standardizing after splitting
X_train, X_test, y_train, y_test = train_test_split(data, target)
sc = StandardScaler()
X_train_std = sc.fit_transform(X_train)
X_test_std = sc.transform(X_test)

909

asked Nov 05 '18 10:11

Abhinav Bajpai

2 Answers

you could use joblib dump function to save the standard scaler model. Here's a complete example for reference.

from sklearn.preprocessing import StandardScaler
from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris

data, target = load_iris(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(data, target)

sc = StandardScaler()
X_train_std = sc.fit_transform(X_train)

if you want to save the sc standardscaller use the following

from sklearn.externals.joblib import dump, load
dump(sc, 'std_scaler.bin', compress=True)

this will create the file std_scaler.bin and save the sklearn model.

To read the model later use load

sc=load('std_scaler.bin')

Note: sklearn.externals.joblib is deprecated. Install and use the pure joblib instead

112

answered Oct 18 '22 20:10

sukhbinder

Or if you like to pickle:

import pickle
pickle.dump(sc, open('file/path/scaler.pkl','wb'))

sc = pickle.load(open('file/path/scaler.pkl','rb'))

answered Oct 18 '22 22:10

Kevin Mc

Related questions
                            
                                In string formatting can I replace only one argument at a time?
                            
                                Complexity of deleting a key from python ordered dict
                            
                                Catch `Exception` in fast api globally
                            
                                What happened to types.ClassType in python 3?
                            
                                What dtype to use for money representation in pandas dataframe?
                            
                                Python: Can a subclass of float take extra arguments in its constructor?
                            
                                How to convert frozenset to normal sets or list?
                            
                                PyCrypto install error on Windows
                            
                                SyntaxError with passing **kwargs and trailing comma
                            
                                How to define custom properties in enumeration in Python (Javascript-like) [duplicate]
                            
                                How to extract zip file recursively?
                            
                                Converting a float to bytearray
                            
                                ModuleNotFoundError: No module named '_sqlite3'
                            
                                Is it correct to modify old migration files in Django?
                            
                                Check if a class is a dataclass in Python
                            
                                Title words in a column except certain words
                            
                                Python Get Property if Object is not None
                            
                                How to replace 'any strings' with nan in pandas DataFrame using a boolean mask?
                            
                                print syntax error with python 3 [duplicate]
                            
                                Format Python Decimal object to a specified precision

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With