DecisionTreeClassifier vs ExtraTreeClassifier

1 Answers

ExtraTreeClassifier is an extremely randomized version of DecisionTreeClassifier meant to be used internally as part of the ExtraTreesClassifier ensemble.

Averaging ensembles such as a RandomForestClassifier and ExtraTreesClassifier are meant to tackle the variance problems (lack of robustness with respect to small changes in the training set) of individual DecisionTreeClassifier instances.

If your main goal is maximizing prediction accuracy you should almost always use an ensemble of decision trees such as ExtraTreesClassifier (or alternatively a boosting ensemble) instead of training individual decision trees.

Have a look at the original Extra Trees paper for more details.

170

answered Sep 20 '22 14:09

ogrisel

Related questions
                            
                                Grid Search parameter and cross-validated data set in KNN classifier in Scikit-learn
                            
                                How to get balanced sample of classes from an imbalanced dataset in sklearn?
                            
                                sklearn-LinearRegression: could not convert string to float: '--'
                            
                                Multiple Linear Regression with specific constraint on each coefficients on Python
                            
                                Sklearn Chi2 For Feature Selection
                            
                                Classification metrics can't handle a mix of binary and continuous targets [duplicate]
                            
                                SKLearn how to get decision probabilities for LinearSVC classifier
                            
                                ROC curve for binary classification in python
                            
                                Prune unnecessary leaves in sklearn DecisionTreeClassifier
                            
                                K-means using only specific dataframe columns with scikit-learn
                            
                                Pipeline OrdinalEncoder ValueError Found unknown categories
                            
                                ModuleNotFoundError: No module named 'sklearn.preprocessing._data'
                            
                                ImportError: cannot import name choice when importing sklearn.mixture
                            
                                How to change plot_confusion_matrix default figure size in sklearn.metrics package
                            
                                MultinomialNB error: "Unknown Label Type"
                            
                                How are "feature_importances_" ordered in Scikit-learn's RandomForestRegressor
                            
                                Scikit-Learn: Label not x is present in all training examples
                            
                                How compute confusion matrix for multiclass classification in Scikit?
                            
                                Mini batch-training of a scikit-learn classifier where I provide the mini batches
                            
                                SciKit-Learn Label Encoder resulting in error 'argument must be a string or number'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

DecisionTreeClassifier vs ExtraTreeClassifier

Tags:

scikit-learn

decision-tree

dragoon

People also ask

1 Answers

ogrisel

Recent Activity

Donate For Us