Scikit-learn Random Forest out of bag sample

Tags:

I am trying to access the out of bag samples associated with each tree in a RandomForestClassifier with no luck. I found other informations like Gini score and split feature for each node, looking there : https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/tree/_tree.pyx

Does anyone know if it is possible to get the out of bag sample related to a tree ? If not maybe it is possible to get the 'in bag' sample (subset of the dataset used for a specific tree) and then compute the OOB using the original data set ?

Thanks in advance

509

asked Oct 22 '15 14:10

wootwoot

1 Answers

You can just figure this out by yourself from source code, look how private _set_oob_score method of random forest works. Every tree estimator in scikit-learn has it's own seed for pseudo random number generator, it's stored inside estimator.random_state field.

During fit procedure every estimator learns on subset of training set, indices for subset of training set will be generated with PRNG and seed from estimator.random_state.

This should work:

from sklearn.ensemble.forest import _generate_unsampled_indices
# X here - training set of examples
n_samples = X.shape[0]
for tree in rf.estimators_:
    # Here at each iteration we obtain out of bag samples for every tree.
    unsampled_indices = _generate_unsampled_indices(
    tree.random_state, n_samples)

answered Nov 15 '22 11:11

Ibraim Ganiev

Related questions
                            
                                How do I add nested categories to a Django model?
                            
                                What is the difference between various methods of creating a User object in django?
                            
                                How to execute python program using a shell script (and makefile?)
                            
                                Python: How can i send multiple HTTP requests and receive the response?
                            
                                How to generate list of random integers, but only using specified integers? (Python) [duplicate]
                            
                                Combine two pandas dataframes adding corresponding values
                            
                                Uncompressing a .Z file with Python
                            
                                Contention problems in Google App Engine
                            
                                Active tag on Bootstrap with Django
                            
                                import runs tests twice in pytest
                            
                                Piping to head results in broken pipe in shell script called from python
                            
                                AttributeError: 'Response' object has no attribute 'read'
                            
                                Element wise comparison between 1D and 2D array
                            
                                Where is the Python interpreter that Sublime Text uses to run plugins?
                            
                                match a regular expression with optional lookahead
                            
                                pyinstaller: change application icon
                            
                                Finding matching strings when comparing two lists
                            
                                Read console output of another program in Python
                            
                                Stop pydoc from running my Python program
                            
                                Python-Django timezone is not working properly

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scikit-learn Random Forest out of bag sample

Tags:

python

machine-learning

scikit-learn

random-forest

wootwoot

People also ask

1 Answers

Ibraim Ganiev

Recent Activity

Donate For Us