How to access weighting of indiviual decision trees in xgboost?

Tags:

I'm using xgboost for ranking with

param = {'objective':'rank:pairwise', 'booster':'gbtree'}

As I understand gradient boosting works by calculating the weighted sum of the learned decision trees. How can I access the weights that are assigned to each learned booster? I wanted to try to post-process the weights after training to speed up the prediction step but I don't know how to get the individual weights. When using dump_model(), the different decision trees can be seen in the created file but no weighting is stored there. In the API I haven't found a suitable function. Or can I calculate the weights by hand with the shrinkage parameter eta ?

715

asked Oct 05 '15 14:10

саша

1 Answers

Each tree is given the same weight eta and the overall prediction is the sum of the predictions of each tree, as you say.

You'd perhaps expect that the earlier trees are given more weight than the latter trees but that's not necessary, due to the way the response is updated after every tree. Here's a toy example:

Suppose we have 5 observations, with responses 10, 20, 30, 40, 50. The first tree is built and gives predictions of 12, 18, 27, 39, 54.

Now, if eta = 1, the response variables passed to the next tree will be -2, 2, 3, 1, -4 (i.e. the difference between the prediction and the true response). The next tree will then try to learn the 'noise' that wasn't captured by the first tree. If nrounds = 2, then the sum of the predictions from the two trees will give the final prediction of the model.

If instead eta = 0.1, all trees will have their predictions scaled down by eta, so the first tree will instead 'predict' 1.2, 1.8, 2.7, 3.9, 5.4. The response variable passed to the next tree will then have values 8.8, 18.2, 27.3, 36.1, 44.6 (the difference between the scaled prediction and the true response) The second round then uses these response values to build another tree - and again the predictions are scaled by eta. So tree 2 predicts say, 7, 18, 25, 40, 40, which, once scaled, become 0.7, 1.8, 2.5, 4.0, 4.0. As before, the third tree will be passed the difference between these values and the previous tree's response variable (so 8.1, 16.4, 24.8, 32.1. 40.6). Again, the sum of the predictions from all trees will give the final prediction.

Clearly when eta = 0.1, and base_score is 0, you'll need at least 10 rounds to get a prediction that's anywhere near sensible. In general, you need an absolute minimum of 1/eta rounds and typically many more.

The rationale for using a small eta is that the model benefits from taking small steps towards the prediction rather than making tree 1 do the majority of the work. It's a bit like crystallisation - cool slowly and you get bigger, better crystals. The downside is you need to increase nrounds, thus increasing the runtime of the algorithm.

answered Sep 21 '22 14:09

dataShrimp

Related questions
                            
                                How to execute a python script and write output to txt file?
                            
                                Convert string into a function call
                            
                                Scan complete directory tree using pep8
                            
                                Divide an image into 5x5 blocks in python and compute histogram for each block
                            
                                Runtime error:App registry isn't ready yet
                            
                                How do I refer to the index of my Pandas dataframe?
                            
                                How do we delete a shape that's already been created in Tkinter canvas?
                            
                                python: How to use POS (part of speech) features in scikit learn classfiers (SVM) etc
                            
                                Apply styles while exporting to 'xlsx' in pandas with XlsxWriter
                            
                                flask-migrate doesn't detect models
                            
                                I can't install Gevent
                            
                                What does the "tk.call" function do in Python/Tkinter?
                            
                                How to vertically concatenate two arrays in Python? [duplicate]
                            
                                Creating classes with a lot of imported functions here and there
                            
                                Pandas: Always selecting the first sheet/tab in an Excel Sheet
                            
                                Find all local Maxima and Minima when x and y values are given as numpy arrays
                            
                                Create sample numpy array with randomly placed NaNs
                            
                                Seaborn distplot y-axis normalisation wrong ticklabels
                            
                                How do you implement token authentication in Flask?
                            
                                python 3.5 type hints: can i check if function arguments match type hints?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to access weighting of indiviual decision trees in xgboost?

Tags:

python

decision-tree

xgboost

boosting

саша

People also ask

1 Answers

dataShrimp

Recent Activity

Donate For Us