I am just trying my hand in automated model training via Amazon Autogluon based on Mxnet, especially, TabularPrediction. It is indeed training several models such as Catboost, LightGBM etc. based on the training data and dumping the models into a specified directory as pickle files.
However, the whole thing is still too much of a blackbox to me. In particular, I am looking for ways so that
Are these things possible? Some reference, assuming that Amazon does provide APIs for these would be greatly helpful.
Note: These answers apply to AutoGluon 0.0.9
Regarding adjustments to models, this can be done by specifying hyperparameters in the fit() call. The ability to adjust models post-fit is a feature which is not implemented yet, but is planned for future releases.
Regarding inspection of models: predictor.info() will show all model hyperparameters and various other information.
To export them as stand-alone models, call predictor.delete_models(models_to_keep='my_model', dry_run=False). This will delete all models except the particular model you wish to keep, effectively treating the predictor as a single model. Note that it still must use the predictor API, as the model requires data preprocessing present outside of its pickle file.
Regarding adjusting model weights, this is an open feature request that is intended to be added in a future release: https://github.com/awslabs/autogluon/issues/486
Regarding including custom models, the best way without extending AutoGluon at present is to treat the output of AutoGluon as a single model and utilize your own ensembling code. Alternatively, you can inherit from the AbstractModel class of AutoGluon to wrap your model, but this is intended for developers and is not well documented.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With