Learning rate of a Q learning agent

Tags:

The question how the learning rate influences the convergence rate and convergence itself. If the learning rate is constant, will Q function converge to the optimal on or learning rate should necessarily decay to guarantee convergence?

840

asked Oct 08 '15 09:10

uduck

1 Answers

Learning rate tells the magnitude of step that is taken towards the solution.

It should not be too big a number as it may continuously oscillate around the minima and it should not be too small of a number else it will take a lot of time and iterations to reach the minima.

The reason why decay is advised in learning rate is because initially when we are at a totally random point in solution space we need to take big leaps towards the solution and later when we come close to it, we make small jumps and hence small improvements to finally reach the minima.

Analogy can be made as: in the game of golf when the ball is far away from the hole, the player hits it very hard to get as close as possible to the hole. Later when he reaches the flagged area, he choses a different stick to get accurate short shot.

So its not that he won't be able to put the ball in the hole without choosing the short shot stick, he may send the ball ahead of the target two or three times. But it would be best if he plays optimally and uses the right amount of power to reach the hole. Same is for decayed learning rate.

answered Sep 28 '22 08:09

VishalTheBeast

Related questions
                            
                                WEKA: How to filter multiple attribute ranges?
                            
                                Discovering "templates" in a given text?
                            
                                Histogram approximation for streaming data
                            
                                Basic understanding of the Adaboost algorithm
                            
                                What are the advantages or disadvantages of having multiple output nodes compared to a few within a neural network
                            
                                Implementations of local regression and local likelihood methods
                            
                                Implementing Support Vector Machine - EFFICIENTLY computing gram-matrix K
                            
                                How to train image (pixel) data in libsvm format to use for recognition with Java
                            
                                scikit learn clf.fit / score model accuracy
                            
                                SVM - relation between the number of training samples and the number of features
                            
                                Rescaling after feature scaling, linear regression
                            
                                Binning of continuous variables in sklearn ensemble and trees
                            
                                Gaussian-RBM fails on a trivial example
                            
                                which is best svm example which classifies plain input text?
                            
                                Vowpal Wabbit training and testing data formats
                            
                                Cannot connect PlainText (JSON) to Dataset at Azure Machine Learning
                            
                                Doing hyperparameter estimation for the estimator in each fold of Recursive Feature Elimination
                            
                                Using sklearn cross_val_score and kfolds to fit and help predict model
                            
                                Want to know the diff among pd.factorize, pd.get_dummies, sklearn.preprocessing.LableEncoder and OneHotEncoder [closed]
                            
                                How to map features from the output of a VectorAssembler back to the column names in Spark ML?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Learning rate of a Q learning agent

Tags:

machine-learning

reinforcement-learning

q-learning

uduck

People also ask

1 Answers

VishalTheBeast

Recent Activity

Donate For Us