RandomForestClassifier vs ExtraTreesClassifier in scikit learn

2 Answers

Yes both conclusions are correct, although the Random Forest implementation in scikit-learn makes it possible to enable or disable the bootstrap resampling.

In practice, RFs are often more compact than ETs. ETs are generally cheaper to train from a computational point of view but can grow much bigger. ETs can sometime generalize better than RFs but it's hard to guess when it's the case without trying both first (and tuning n_estimators, max_features and min_samples_split by cross-validated grid search).

127

answered Sep 16 '22 21:09

ogrisel

ExtraTrees classifier always tests random splits over fraction of features (in contrast to RandomForest, which tests all possible splits over fraction of features)

answered Sep 19 '22 21:09

Muhammad Umar Amanat

Related questions
                            
                                A progress bar for scikit-learn?
                            
                                Scikit-learn train_test_split with indices
                            
                                Use scikit-learn to classify into multiple categories
                            
                                Converting list to numpy array
                            
                                How to one-hot-encode from a pandas column containing a list?
                            
                                ModuleNotFoundError: No module named 'sklearn'
                            
                                fit_transform() takes 2 positional arguments but 3 were given with LabelBinarizer
                            
                                Will scikit-learn utilize GPU?
                            
                                TypeError: cannot perform reduce with flexible type
                            
                                Scikit-learn: How to obtain True Positive, True Negative, False Positive and False Negative
                            
                                Recovering features names of explained_variance_ratio_ in PCA with sklearn
                            
                                Accuracy Score ValueError: Can't Handle mix of binary and continuous target
                            
                                scikit-learn .predict() default threshold
                            
                                LogisticRegression: Unknown label type: 'continuous' using sklearn in python
                            
                                Passing categorical data to Sklearn Decision Tree
                            
                                ImportError: No module named model_selection
                            
                                sklearn.LabelEncoder with never seen before values
                            
                                Logistic regression python solvers' definitions
                            
                                Understanding min_df and max_df in scikit CountVectorizer
                            
                                sklearn plot confusion matrix with labels

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

RandomForestClassifier vs ExtraTreesClassifier in scikit learn

Tags:

scikit-learn

random-forest

denson

People also ask

2 Answers

ogrisel

Muhammad Umar Amanat

Recent Activity

Donate For Us