How to plot the learning curves in lightgbm and Python?

Tags:

I have trained a lightgbm model and I would like to plot the learning curves. How can I do that? In Keras for examples history returns the metrics so that I can plot them once training is over. How this task is handled here?

My code is the following:

def f_lgboost(data, params):

    model = lgb.LGBMClassifier(**params)


    X_train = data['X_train']

    y_train = data['y_train']

    X_dev = data['X_dev']

    y_dev = data['y_dev']

    X_test = data['X_test']

    categorical_feature= ['Ticker_code', 'Category_code']

    X_train[categorical_feature] = X_train[categorical_feature].astype('category')

    X_dev[categorical_feature] = X_dev[categorical_feature].astype('category')

    X_test[categorical_feature] = X_test[categorical_feature].astype('category')


    feature_name = X_train.columns.to_list()

    model.fit(X_train, y_train, eval_set = [(X_dev, y_dev)], eval_metric = 'auc', early_stopping_rounds = 20, 
              categorical_feature = categorical_feature, feature_name = feature_name)

    y_pred_train = model.predict_proba(X_train)[:, 1].ravel()

    y_pred_dev = model.predict_proba(X_dev)[:, 1].ravel()

    from sklearn.metrics import roc_auc_score

    auc_train = roc_auc_score(y_train, y_pred_train)

    auc_dev = roc_auc_score(y_dev, y_pred_dev)

    from sklearn.metrics import precision_recall_fscore_support

    precision, recall ,fscore, support = precision_recall_fscore_support(y_dev, (y_pred_dev > 0.5).astype(int), beta=0.5)

    y_pred_test = model.predict_proba(X_test)[:, 1].ravel()

    print(f'auc_train: {auc_train}, auc_dev : {auc_dev}, precision : {precision}, recall: {recall}, fscore : {fscore}')

    Results = {

            'params' : params,

            'data' : data,

            'lg_boost_model' : bst,

            'y_pred_train' : y_pred_train,

            'y_pred_dev' : y_pred_dev,

            'y_pred_test' : y_pred_test,

            'auc_train' : auc_train,

            'auc_dev' : auc_dev,

            'precision_dev': precision,

            'recall_dev' : recall,

            'fscore_dev' : fscore,

            'support_dev' : support


        }


    return Results

enter image description here

208

asked Feb 08 '20 23:02

user8270077

1 Answers

In the scikit-learn API, the learning curves are available via attribute lightgbm.LGBMModel.evals_result_. They will include metrics computed with datasets specified in the argument eval_set of method fit (so you would normally want to specify there both the training and the validation sets). There is also built-in plotting function, lightgbm.plot_metric, which accepts model.evals_result_ or model directly.

Here is a complete minimal example:

import lightgbm as lgb
import sklearn.datasets, sklearn.model_selection

X, y = sklearn.datasets.load_boston(return_X_y=True)
X_train, X_val, y_train, y_val = sklearn.model_selection.train_test_split(X, y, random_state=7054)

model = lgb.LGBMRegressor(objective='mse', seed=8798, num_threads=1)
model.fit(X_train, y_train, eval_set=[(X_val, y_val), (X_train, y_train)], verbose=10)

lgb.plot_metric(model)

Here is the resulting plot:

Learning curves

164

answered Sep 28 '22 00:09

Andrey Popov

Related questions
                            
                                How to compare individual characters in two strings in Python 3
                            
                                pyQt: How do I update a label?
                            
                                Network capturing with Selenium/PhantomJS
                            
                                Custom Python gTTS voice
                            
                                python3: UTF-8 encoding in http.server
                            
                                python getattr() with multiple params
                            
                                Python list comprehension with dummy names identical to iterator name: ill-advised?
                            
                                Convert ascii string to base64 without the "b" and quotation marks
                            
                                Python Pandas Fillna Median not working
                            
                                Flatten a list of elements in Pandas DataFrame
                            
                                doing "nothing" in else command of if-else clause [duplicate]
                            
                                adding static() to urlpatterns only work by appending to the list
                            
                                Pytorch - Stack dimension must be exactly the same?
                            
                                Unable to print names in the right way in another function
                            
                                Merging multiple CSV files into separate tabs of a spreadsheet in Python
                            
                                Reading Data From Cloud Storage Via Cloud Functions
                            
                                Joining on datetime64[ns, UTC] fails using pandas.join
                            
                                How to access files within subfolders of a bucket GCS using Python?
                            
                                F String Invalid Syntax in Python 3.5 [closed]
                            
                                Element wise concatenate multiple lists (list of list of strings)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to plot the learning curves in lightgbm and Python?

Tags:

python-3.x

lightgbm

user8270077

People also ask

1 Answers

Andrey Popov

Recent Activity

Donate For Us