How do you access tree depth in Python's scikit-learn?

Tags:

I'm using scikit-learn to create a Random Forest. However, I want to find the individual depths of each tree. It seems like a simple attribute to have but according to the documentation, (http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html) there is no way of accessing it.

If this isn't possible, is there a way of accessing the tree depth from a Decision Tree model?

Any help would be appreciated. Thank you.

659

asked Dec 11 '15 00:12

iltp38

1 Answers

Each instance of RandomForestClassifier has an estimators_ attribute, which is a list of DecisionTreeClassifier instances. The documentation shows that an instance of DecisionTreeClassifier has a tree_ attribute, which is an instance of the (undocumented, I believe) Tree class. Some exploration in the interpreter shows that each Tree instance has a max_depth parameter which appears to be what you're looking for -- again, it's undocumented.

In any case, if forest is your instance of RandomForestClassifier, then:

>>> [estimator.tree_.max_depth for estimator in forest.estimators_] [9, 10, 9, 11, 9, 9, 11, 7, 13, 10]

should do the trick.

Each estimator also has a get_depth() method than can be used to retrieve the same value with briefer syntax:

>>> [estimator.get_depth() for estimator in forest.estimators_] [9, 10, 9, 11, 9, 9, 11, 7, 13, 10]

To avoid mixup, it should be noted that there is an attribute of each estimator (and not each estimator's tree_) called max depth which returns the setting of the parameter rather than the depth of the actual tree. How estimator.get_depth(), estimator.tree_.max_depth, and estimator.max_depth relate to each other is clarified in the example below:

from sklearn.datasets import load_iris from sklearn.ensemble import RandomForestClassifier clf = RandomForestClassifier(n_estimators=3, random_state=4, max_depth=6) iris = load_iris() clf.fit(iris['data'], iris['target']) [(est.get_depth(), est.tree_.max_depth, est.max_depth) for est in clf.estimators_]

Out:

[(6, 6, 6), (3, 3, 6), (4, 4, 6)]

Setting max depth to the default value None would allow the first tree to expand to depth 7 and the output would be:

[(7, 7, None), (3, 3, None), (4, 4, None)]

answered Oct 07 '22 16:10

jme

Related questions
                            
                                Guide in organizing large Django projects [closed]
                            
                                Difference between yield in Python and yield in C#
                            
                                How to load a C# dll in python?
                            
                                Colorbar for matplotlib plot_surface command
                            
                                Python overriding getter without setter
                            
                                Scipy curvefit RuntimeError:Optimal parameters not found: Number of calls to function has reached maxfev = 1000
                            
                                Join multiple tables in SQLAlchemy/Flask
                            
                                How can I serve NPM packages using Flask?
                            
                                How to plot a 3D density map in python with matplotlib
                            
                                Replace sub part of matrix by another small matrix in numpy
                            
                                Numpy individual element access slower than for lists
                            
                                How to convert a given ordinal number (from Excel) to a date
                            
                                In Django 1.9, what's the convention for using JSONField (native postgres jsonb)?
                            
                                Pipenv with Conda?
                            
                                How to get filename from Content-Disposition in headers
                            
                                Embedding IPython Qt console in a PyQt application
                            
                                How to skip the rest of tests in the class if one has failed?
                            
                                What does "del" do exactly?
                            
                                How to add a key-value to JSON data retrieved from a file?
                            
                                multiple key value pairs in dict comprehension

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you access tree depth in Python's scikit-learn?

Tags:

python

scikit-learn

decision-tree

random-forest

depth

iltp38

People also ask

1 Answers

jme

Recent Activity

Donate For Us