Access train and evaluation error in xgboost

Tags:

I started using python xgboost backage. Is there a way to get training and validation errors at each training epoch? I can't find one in the documentation

Have trained a simple model and got output:

[09:17:37] src/tree/updater_prune.cc:74: tree pruning end, 1 roots, 124 extra nodes, 0 pruned nodes, max_depth=6

[0] eval-rmse:0.407474 train-rmse:0.346349 [09:17:37] src/tree/updater_prune.cc:74: tree pruning end, 1 roots, 116 extra nodes, 0 pruned nodes, max_depth=6

1 eval-rmse:0.410902 train-rmse:0.339925 [09:17:38] src/tree/updater_prune.cc:74: tree pruning end, 1 roots, 124 extra nodes, 0 pruned nodes, max_depth=6

[2] eval-rmse:0.413563 train-rmse:0.335941 [09:17:38] src/tree/updater_prune.cc:74: tree pruning end, 1 roots, 126 extra nodes, 0 pruned nodes, max_depth=6

[3] eval-rmse:0.418412 train-rmse:0.333071 [09:17:38] src/tree/updater_prune.cc:74: tree pruning end, 1 roots, 114 extra nodes, 0 pruned nodes, max_depth=6

However I need to pass these eval-rmse and train-rmse further in code or at least plot these curves.

523

asked Feb 04 '16 09:02

MaxPY

2 Answers

One way to save your intermediate results is by passing evals_result argument to xgb.train method.

Let's say you have created a train and an eval matrix in XGB format, and have initialized some parameters params for XGBoost (In my case, params = {'max_depth':2, 'eta':1, 'silent':1, 'objective':'binary:logistic' }).

Create an empty dict

progress = dict()
Create a watchlist, (I guess you already have it given that you are printing train-rmse)

watchlist = [(train,'train-rmse'), (eval, 'eval-rmse')]
Pass these to xgb.train

bst = xgb.train(param, train, 10, watchlist, evals_result=progress)

At the end of iteration, the progress dictionary will contain the desired train/validation errors

> print progress
{'train-rmse': {'error': ['0.50000', ....]}, 'eval-rmse': { 'error': ['0.5000',....]}}

138

answered Sep 28 '22 19:09

Sudeep Juvekar

@MaxPY, this is in reply to your comment on Sudeep Juvekar's answer above: the keys for your progress dictionary is set to whatever string you pass as the second argument to the watchlist. For instance,

watchlist  = [(train,'train-rmse-demo'), (eval, 'eval-rmse-demo')]

sets the dictionary keys to train-rmse-demo and eval-rmse-demo

answered Sep 28 '22 17:09

Sunny Jha

Related questions
                            
                                Install psycopg2 for Anaconda Python
                            
                                SQLite 3 Database with Django
                            
                                Get consistent Key error: \n [duplicate]
                            
                                Load JPEG from URL to skimage without temporary file
                            
                                Is it possible to overload logical and in Python?
                            
                                Changing a class attribute within __init__
                            
                                Sorting and auto filtering Excel with openpyxl
                            
                                Building a Bootstrap table with dynamic elements in Flask
                            
                                NumPy sum along disjoint indices
                            
                                Plots are not visible using matplotlib plt.show()
                            
                                Embed "Bokeh created html file" into Flask "template.html" file
                            
                                How do I list contents of a gz file without extracting it in python?
                            
                                Matplotlib: how to plot with a specific hex color and a specific marker?
                            
                                pandas - check for non unique values in dataframe groupby
                            
                                Histogram fitting with python
                            
                                concordance for a phrase using NLTK in Python
                            
                                Python Doesn't Have Permission To Access On This Server / Return City/State from ZIP
                            
                                gcc error on install of tesseract-ocr
                            
                                How to calculate percent change compared to the beginning value using pandas?
                            
                                Smooth surface Plot with Pyplot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Access train and evaluation error in xgboost

Tags:

python

machine-learning

xgboost

MaxPY

People also ask

2 Answers

Sudeep Juvekar

Sunny Jha

Recent Activity

Donate For Us