How to use RFE with xgboost Booster?

Tags:

I'm currently using xgb.train(...) which returns a booster but I'd like to use RFE to select the best 100 features. The returned booster cannot be used in RFE as it's not a sklearn estimator. XGBClassifier is the sklearn api into the xgboost library, however, I am not able to get the same results as with the xgb.train(...) method (10% worse on roc-auc). I've tried the sklearn boosters but they're not able to get similar results either. I've also tried to wrap the xgb.train(...) method in a class to add sklearn estimator methods but there's just too many to change. Is there some way to use the xgb.train(...) along with RFE from sklearn?

835

asked Feb 22 '21 01:02

pmdaly

1 Answers

For this kind of problem, I created shap-hypetune: a python package for simultaneous Hyperparameters Tuning and Features Selection for Gradient Boosting Models

In your case, this enables you to perform RFE with XGBClassifier in a very simple and intuitive way:

from shaphypetune import BoostRFE

model = BoostRFE(XGBClassifier(), min_features_to_select=1, step=1)
model.fit(X_train, y_train, eval_set=[(X_valid, y_valid)], early_stopping_rounds=6, verbose=0)

pred = model.predict(X_test)

As you can see, you can use all the fitting options available in the standard XGB API, like early_stopping_rounds or custom metrics, to customize the training process.

You can use shap-hypetune also to compute parameter tuning (also simultaneously with feature selection) or to compute feature selection with RFE or Boruta using SHAP feature importance. Full example available here

answered Sep 28 '22 03:09

Marco Cerliani

Related questions
                            
                                how to convert a dataframe of counts to a probability density function
                            
                                Prevent changing indentation from tabs to spaces
                            
                                WARNING:tensorflow:AutoGraph could not transform <function format_example at ...> and will run it as-is
                            
                                Atom: Can't search for Packages or Themes in the Install Packages section of Settings [closed]
                            
                                How to instal python packages for Spyder
                            
                                How to run compiled vue project in django
                            
                                BS4 replace_with result is no longer in tree
                            
                                Matplotlib, vertical space between legend symbols
                            
                                Apply function to each row in Pandas dataframe by group
                            
                                How to run a Julia file, which uses a package, in Python?
                            
                                How to Use Class Weights with Focal Loss in PyTorch for Imbalanced dataset for MultiClass Classification
                            
                                Alembic ignore specific tables
                            
                                Train and predict on variable length sequences
                            
                                How to make setup.py for standalone python application in a right way?
                            
                                Why does this solution work in Javascript but not in Python? (Dynamic programming)
                            
                                Hung cells: running multiple jupyter notebooks in parallel with papermill
                            
                                Cannot install Google Colab locally
                            
                                "Defaulting to user installation because normal site-packages is not writeable" python message
                            
                                Create Pandas DataFrame from space separated String
                            
                                Is it possible to get Dash-Cytoscape graph tooltips?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use RFE with xgboost Booster?

Tags:

python

scikit-learn

feature-selection

xgboost

lightgbm

pmdaly

People also ask

1 Answers

Marco Cerliani

Recent Activity

Donate For Us