Feature importance using lightgbm

Tags:

I am trying to run my lightgbm for feature selection as below;

initialization

# Initialize an empty array to hold feature importances
feature_importances = np.zeros(features_sample.shape[1])

# Create the model with several hyperparameters
model = lgb.LGBMClassifier(objective='binary', 
         boosting_type = 'goss', 
         n_estimators = 10000, class_weight ='balanced')

then i fit the model as below

# Fit the model twice to avoid overfitting
for i in range(2):

   # Split into training and validation set
   train_features, valid_features, train_y, valid_y = train_test_split(train_X, train_Y, test_size = 0.25, random_state = i)

   # Train using early stopping
   model.fit(train_features, train_y, early_stopping_rounds=100, eval_set = [(valid_features, valid_y)], 
             eval_metric = 'auc', verbose = 200)

   # Record the feature importances
   feature_importances += model.feature_importances_

but i get the below error

Training until validation scores don't improve for 100 rounds. 
Early stopping, best iteration is: [6]  valid_0's auc: 0.88648
ValueError: operands could not be broadcast together with shapes (87,) (83,) (87,)

419

asked Nov 21 '18 13:11

Ian Okeyo

1 Answers

An example for getting feature importance in lightgbm when using train model.

import matplotlib.pyplot as plt
import seaborn as sns
import warnings
warnings.simplefilter(action='ignore', category=FutureWarning)

def plotImp(model, X , num = 20, fig_size = (40, 20)):
    feature_imp = pd.DataFrame({'Value':model.feature_importance(),'Feature':X.columns})
    plt.figure(figsize=fig_size)
    sns.set(font_scale = 5)
    sns.barplot(x="Value", y="Feature", data=feature_imp.sort_values(by="Value", 
                                                        ascending=False)[0:num])
    plt.title('LightGBM Features (avg over folds)')
    plt.tight_layout()
    plt.savefig('lgbm_importances-01.png')
    plt.show()

141

answered Oct 24 '22 23:10

rosefun

Related questions
                            
                                polynomial regression using python
                            
                                Sparse Efficiency Warning while changing the column
                            
                                How (if it is possible) can I get the version of Django REST framework?
                            
                                HTTP Error 403: Forbidden with urlretrieve
                            
                                Python Seaborn Facetgrid change xlabels
                            
                                Remove the 0b in binary
                            
                                Python: How to resize an image using PIL module
                            
                                ImportError: No module named 'nltk.tokenize'; 'nltk' is not a package
                            
                                Homebrew: Cannot link python
                            
                                Explain onehotencoder using python
                            
                                Selenium Webdriver: How to Download a PDF File with Python?
                            
                                Python Pandas: Passing arguments to a function in agg()
                            
                                How to run coverage.py on a directory?
                            
                                Cannot add layers to saved Keras Model. 'Model' object has no attribute 'add'
                            
                                How can I get the first value of a deque without deleting it?
                            
                                Sort a list based on a given order [duplicate]
                            
                                Converting Pandas DatetimeIndex to a numeric format
                            
                                How can I increase the accuracy of my Linear Regression model?(machine learning with python)
                            
                                Python split full name into two variables with possibly multi-word last name
                            
                                Django csrf token for Ajax

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Feature importance using lightgbm

Tags:

python

python-3.x

lightgbm

Ian Okeyo

People also ask

1 Answers

rosefun

Recent Activity

Donate For Us