How to duplicate an estimator in order to use it on multiple data sets?

Tags:

Here is an example that creates two data sets:

from sklearn.linear_model import LogisticRegression from sklearn.datasets import make_classification  # data set 1 X1, y1 = make_classification(n_classes=2, n_features=5, random_state=1) # data set 2 X2, y2 = make_classification(n_classes=2, n_features=5, random_state=2)

I want to use the LogisticRegression estimator with the same parameter values to fit a classifier on each data set:

Click to copy

lr = LogisticRegression()  clf1 = lr.fit(X1, y1) clf2 = lr.fit(X2, y2)  print "Classifier for data set 1: " print "  - intercept: ", clf1.intercept_ print "  - coef_: ", clf1.coef_  print "Classifier for data set 2: " print "  - intercept: ", clf2.intercept_ print "  - coef_: ", clf2.coef_

The problem is that both classifiers are the same:

Click to copy

Classifier for data set 1:    - intercept:  [ 0.05191729]   - coef_:  [[ 0.06704494  0.00137751 -0.12453698 -0.05999127  0.05798146]] Classifier for data set 2:    - intercept:  [ 0.05191729]   - coef_:  [[ 0.06704494  0.00137751 -0.12453698 -0.05999127  0.05798146]]

For this simple example, I could use something like:

Click to copy

lr1 = LogisticRegression() lr2 = LogisticRegression()  clf1 = lr1.fit(X1, y1) clf2 = lr2.fit(X2, y2)

to avoid the problem. However, the question remains: How to duplicate / copy an estimator with its particular parameter values in general?

330

asked Dec 04 '12 11:12

tjanez

1 Answers

Click to copy

from sklearn.base import clone  lr1 = LogisticRegression() lr2 = clone(lr1)

180

answered Sep 28 '22 05:09

Fred Foo

Related questions
                            
                                The simplest possible reverse proxy [closed]
                            
                                How to structure python packages without repeating top level name for import
                            
                                Opening Local File Works with urllib but not with urllib2
                            
                                Save python random forest model to file
                            
                                TypeError: only integer arrays with one element can be converted to an index 3
                            
                                Comparing floats in a pandas column
                            
                                How do i find the iloc of a row in pandas dataframe?
                            
                                Add an object by id in a ManyToMany relation in Django
                            
                                Pip Install Twisted Error 1
                            
                                How do you implement __str__ for a function?
                            
                                Tensorflow estimator ValueError: logits and labels must have the same shape ((?, 1) vs (?,))
                            
                                Does a "with" statement support type hinting?
                            
                                Unpacking: [x,y], (x,y), x,y - what is the difference?
                            
                                What is your favorite solution for managing database migrations in django? [closed]
                            
                                How to write an application for the system tray in Linux
                            
                                Python: module for creating PID-based lockfile?
                            
                                Python - Why use anything other than uuid4() for unique strings?
                            
                                web.py and flask [closed]
                            
                                Python matplotlib restrict to integer tick locations
                            
                                IOError: [Errno 13] Permission denied when trying to open hidden file in "w" mode

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to duplicate an estimator in order to use it on multiple data sets?

Tags:

python

machine-learning

scikit-learn

tjanez

People also ask

1 Answers

Fred Foo

Recent Activity

Donate For Us