Use feedback or reinforcement in machine learning?

Tags:

I am trying to solve some classification problem. It seems many classical approaches follow a similar paradigm. That is, train a model with some training set and than use it to predict the class labels for new instances.

I am wondering if it is possible to introduce some feedback mechanism into the paradigm. In control theory, introducing a feedback loop is an effective way to improve system performance.

Currently a straight forward approach on my mind is, first we start with a initial set of instances and train a model with them. Then each time the model makes a wrong prediction, we add the wrong instance into the training set. This is different from blindly enlarge the training set because it is more targeting. This can be seen as some kind of negative feedback in the language of control theory.

Is there any research going on with the feedback approach? Could anyone shed some light?

258

asked Apr 04 '14 05:04

smwikipedia

1 Answers

There are two areas of research that spring to mind.

The first is Reinforcement Learning. This is an online learning paradigm that allows you to get feedback and update your policy (in this instance, your classifier) as you observe the results.

The second is active learning, where the classifier gets to select examples from a pool of unclassified examples to get labelled. The key is to have the classifier choose the examples for labelling which best improve its accuracy by choosing difficult examples under the current classifier hypothesis.

147

answered Oct 04 '22 03:10

Ben Allison

Related questions
                            
                                How do I pass a scalar via a TensorFlow feed dictionary
                            
                                Fine Tuning of GoogLeNet Model
                            
                                TensorFlow: Saver has 5 models limit
                            
                                How do you alter the size of a Pytorch Dataset?
                            
                                How to re-partition pyspark dataframe?
                            
                                Is it possible to visualize a tensorflow graph without a training op?
                            
                                Why vector normalization can improve the accuracy of clustering and classification?
                            
                                TypeError: Expected float32 passed to parameter 'y' of op 'Equal', got 'auto' of type 'str' instead
                            
                                RuntimeError: b'no arguments in initialization list'
                            
                                Very simple text classification by machine learning? [duplicate]
                            
                                OpenCV machine learning functions want CvFileStorage* instead of cv::FileStorage*
                            
                                Multi dimensional input for LSTM in Keras
                            
                                How to build an image classification dataset in Azure?
                            
                                What is the state-of-the-art in unsupervised learning on temporal data?
                            
                                gbm::interact.gbm vs. dismo::gbm.interactions
                            
                                Comparing and matching product names from different stores/suppliers
                            
                                How to prepare a dataset for Keras?
                            
                                Things to try when Neural Network not Converging

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Use feedback or reinforcement in machine learning?

Tags:

machine-learning

data-mining

smwikipedia

People also ask

1 Answers

Ben Allison

Recent Activity

Donate For Us