How to get the MSE of the node in the DecisionTreeRegressor of scikit-learn?

Tags:

In the generated decision tree regression model, there is an MSE attribute when using graphviz to view the tree structure. I need to obtain the MSE of each leaf node, and carry out subsequent operations according to the MSE. However, after reading the document, I can't find the method to provide for output MSE. Other attributes such as feature name, sample number, prediction value, etc. All have are corresponding methods:

Tree structure

With help(sklearn.tree._tree.Tree), I can see that most of the attributes have some methods to output the value, but I don't see anything about MSE.

Help on class Tree in module sklearn.tree._tree

583

asked Dec 17 '19 13:12

Cosmic Roach

1 Answers

Nice question. You need tree_reg.tree_.impurity.

Short answer:

tree_reg = tree.DecisionTreeRegressor(max_depth=2)
tree_reg.fit(X_train, y_train)

extracted_MSEs = tree_reg.tree_.impurity # The Hidden magic is HERE

for idx, MSE in enumerate(tree_reg.tree_.impurity):
    print("Node {} has MSE {}".format(idx,MSE))

Node 0 has MSE 86.873403833
Node 1 has MSE 40.3211827171
Node 2 has MSE 25.6934820064
Node 3 has MSE 19.0053469592
Node 4 has MSE 74.6839429717
Node 5 has MSE 38.3057346817
Node 6 has MSE 39.6709615385

Long answer using the `boston` dataset with visual output:

import pandas as pd
import numpy as np
from sklearn import ensemble, model_selection, metrics, datasets, tree
import graphviz

house_prices = datasets.load_boston()

X_train, X_test, y_train, y_test = model_selection.train_test_split(
    pd.DataFrame(house_prices.data, columns=house_prices.feature_names),
    pd.Series(house_prices.target, name="med_price"),
    test_size=0.20, random_state=42)

tree_reg = tree.DecisionTreeRegressor(max_depth=2)
tree_reg.fit(X_train, y_train)

extracted_MSEs = tree_reg.tree_.impurity # YOU NEED THIS 
print(extracted_MSEs)
#[86.87340383 40.32118272 25.69348201 19.00534696 74.68394297 38.30573468 39.67096154]

# Compare visually
dot_data = tree.export_graphviz(tree_reg, out_file=None, feature_names=X_train.columns)
graph = graphviz.Source(dot_data)

#this will create an boston.pdf file with the rule path
graph.render("boston")

Compare MSE values with visual Output:

enter image description here

186

answered Sep 29 '22 00:09

seralouk

Related questions
                            
                                What are the Tensorflow qint8, quint8, qint32, qint16, and quint16 datatypes?
                            
                                impossible to catch asyncio.TimeoutError?
                            
                                How to sort a list by length and then in reverse alphabetical order
                            
                                Intel MKL FATAL ERROR: Cannot load mkl_intel_thread.dll
                            
                                What solver should I use if my objective function is an nonlinear (also exponential explanation) function? Python GEKKO
                            
                                How do I count letters in a string?
                            
                                Cannot Import Name 'keras_export' From 'tensorflow.python.util.tf_export'
                            
                                How do I pass a keyword argument to the forward used by a pre-forward hook?
                            
                                Why does reading a whole file take up more RAM than its size on DISK?
                            
                                Add keys to a dictionary with automatically incremented values
                            
                                How can I cancel an active boto3 s3 file_download?
                            
                                Which SSIM is correct : skimage.metrics.structural_similarity()?
                            
                                What exactly does pygame.init() do?
                            
                                Can I train a Tensorflow keras model with complex input/output?
                            
                                How to generate all possible combinations with a given condition to make it more efficient?
                            
                                How to use Font Awesome icons in python plotly dash
                            
                                Zero predictions despite masking support for zero-padded mini batch LSTM training in keras
                            
                                How do I crop an image using a binary mask image of the same picture to remove the background in python?
                            
                                Pandas Dataframe: Multiplying Two Columns
                            
                                How do I train gpt 2 from scratch?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get the MSE of the node in the DecisionTreeRegressor of scikit-learn?

Tags:

python

scikit-learn

Cosmic Roach

People also ask

1 Answers

Short answer:

Long answer using the `boston` dataset with visual output:

seralouk

Recent Activity

Donate For Us

How to get the MSE of the node in the DecisionTreeRegressor of scikit-learn?

Tags:

python

scikit-learn

Cosmic Roach

People also ask

1 Answers

Short answer:

Long answer using the boston dataset with visual output:

seralouk

Related questions

Recent Activity

Donate For Us

Long answer using the `boston` dataset with visual output: