muti output regression in xgboost

2 Answers

My suggestion is to use sklearn.multioutput.MultiOutputRegressor as a wrapper of xgb.XGBRegressor. MultiOutputRegressor trains one regressor per target and only requires that the regressor implements fit and predict, which xgboost happens to support.

Click to copy

# get some noised linear data X = np.random.random((1000, 10)) a = np.random.random((10, 3)) y = np.dot(X, a) + np.random.normal(0, 1e-3, (1000, 3))  # fitting multioutputregressor = MultiOutputRegressor(xgb.XGBRegressor(objective='reg:linear')).fit(X, y)  # predicting print np.mean((multioutputregressor.predict(X) - y)**2, axis=0)  # 0.004, 0.003, 0.005

This is probably the easiest way to regress multi-dimension targets using xgboost as you would not need to change any other part of your code (if you were using the sklearn API originally).

However this method does not leverage any possible relation between targets. But you can try to design a customized objective function to achieve that.

194

answered Sep 22 '22 12:09

ComeOnGetMe

It generates warnings: reg:linear is now deprecated in favor of reg:squarederror, so I update an answer based on @ComeOnGetMe's

Click to copy

import numpy as np  import pandas as pd  import xgboost as xgb from sklearn.multioutput import MultiOutputRegressor  # get some noised linear data X = np.random.random((1000, 10)) a = np.random.random((10, 3)) y = np.dot(X, a) + np.random.normal(0, 1e-3, (1000, 3))  # fitting multioutputregressor = MultiOutputRegressor(xgb.XGBRegressor(objective='reg:squarederror')).fit(X, y)  # predicting print(np.mean((multioutputregressor.predict(X) - y)**2, axis=0))

Out:

Click to copy

[2.00592697e-05 1.50084441e-05 2.01412247e-05]

answered Sep 19 '22 12:09

ah bon

Related questions
                            
                                Python Implementation of OPTICS (Clustering) Algorithm
                            
                                What is Depth of a convolutional neural network?
                            
                                Early stopping with Keras and sklearn GridSearchCV cross-validation
                            
                                Why should we use Temperature in softmax? [closed]
                            
                                How do you read Tensorboard files programmatically?
                            
                                How to recognize rectangles in this image?
                            
                                What is the difference between reinforcement learning and deep RL?
                            
                                Best machine learning technique for matching product strings
                            
                                Distinguishing overfitting vs good prediction
                            
                                How to choose number of hidden layers and nodes in neural network? [closed]
                            
                                Which machine learning library to use [closed]
                            
                                Classifying Documents into Categories
                            
                                Keras flowFromDirectory get file names as they are being generated
                            
                                Recommended anomaly detection technique for simple, one-dimensional scenario?
                            
                                When should one use LinearSVC or SVC?
                            
                                How to engineer features for machine learning [closed]
                            
                                sklearn doesn't have attribute 'datasets'
                            
                                How to convert keras(h5) file to a tflite file?
                            
                                Python - How to intuit word from abbreviated text using NLP?
                            
                                Scikit-learn confusion matrix

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

muti output regression in xgboost

Tags:

machine-learning

random-forest

xgboost

user1782011

People also ask

2 Answers

ComeOnGetMe

ah bon

Recent Activity

Donate For Us