Actually there is a lot of question about persistence,but i have tried a lot using <code>pickle</code> or <code>joblib.dumps</code> . but when i use it to save my random forest i got this: <pre class="prettyprint"><code>ValueError: ("Buffer dtype mismatch, expected 'SIZE_t' but got 'long'", <type 'sklearn.tree._tree.ClassificationCriterion'>, (1, array([10]))) </code></pre> Can any one tell me why? some code for review <pre class="prettyprint"><code>forest = RandomForestClassifier() forest.fit(data[:n_samples], target[:n_samples ]) import cPickle with open('rf.pkl', 'wb') as f: cPickle.dump(forest, f) with open('rf.pkl', 'rb') as f: forest = cPickle.load(f) </code></pre> or <pre class="prettyprint"><code>from sklearn.externals import joblib joblib.dump(forest,'rf.pkl') from sklearn.externals import joblib forest = joblib.load('rf.pkl') </code></pre>

Try to import the <code>joblib</code> package directly: <pre class="prettyprint lang-py prettyprint-override"><code>import joblib # ... # save joblib.dump(rf, "some_path") # load rf2 = joblib.load("some_path") </code></pre> I've put the full working example with the code and comments here.

How to save a randomforest in scikit-learn？

Tags:

python

scikit-learn

random-forest

Actually there is a lot of question about persistence,but i have tried a lot using pickle or joblib.dumps . but when i use it to save my random forest i got this:

Click to copy

ValueError: ("Buffer dtype mismatch, expected 'SIZE_t' but got 'long'", <type 'sklearn.tree._tree.ClassificationCriterion'>, (1, array([10])))

Can any one tell me why?

some code for review

Click to copy

forest = RandomForestClassifier()
forest.fit(data[:n_samples], target[:n_samples ])
import cPickle
with open('rf.pkl', 'wb') as f:
    cPickle.dump(forest, f)
with open('rf.pkl', 'rb') as f:
    forest = cPickle.load(f)

Click to copy

from sklearn.externals import joblib
joblib.dump(forest,'rf.pkl') 

from sklearn.externals import joblib
forest = joblib.load('rf.pkl')

420

asked Dec 22 '14 02:12

mrbean

2 Answers

It is caused by using different 32/64 bit version of python to save/load, as Scikits-Learn RandomForrest trained on 64bit python wont open on 32bit python suggests.

138

answered Sep 22 '22 04:09

xgdgsc

Try to import the joblib package directly:

Click to copy

import joblib

# ...

# save
joblib.dump(rf, "some_path")

# load 
rf2 = joblib.load("some_path")

I've put the full working example with the code and comments here.

answered Sep 22 '22 04:09

pplonski

Related questions
                            
                                Search and replace text odfpy
                            
                                ImportError: cannot import name 'certs'
                            
                                Pydev debugger: Unable to find module to reload
                            
                                Does Python optimize dictionary lookups under the hood?
                            
                                Python coverage.py exclude_lines
                            
                                HTTP METHOD categorization cancel vs. delete
                            
                                Python: Visualize a normal curve on data's histogram
                            
                                regex: getting backreference to number, adding to it
                            
                                Schematron validation with lxml in Python: how to retrieve validation errors?
                            
                                Modifying widget colour at runtime without overwriting stylesheet
                            
                                force pandas to read nan as string
                            
                                How does this Python import work?
                            
                                How to print over raw_input's line in Python?
                            
                                /usr/bin/python vs /opt/local/bin/python2.7 on OS X
                            
                                Find all strings that are in between two sub strings
                            
                                PyOpenGL ubuntu 14.04: undefined function error
                            
                                How to "pretty print" a python pandas DatetimeIndex
                            
                                Python: pip tries to install to /bin directory
                            
                                how to extend ambiguous dna sequence
                            
                                Group By operation for large dataset

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to save a randomforest in scikit-learn？

Tags:

python

scikit-learn

random-forest

mrbean

People also ask

2 Answers

xgdgsc

pplonski

Recent Activity

Donate For Us