How can I apply reinforcement learning to continuous action spaces?

Tags:

I'm trying to get an agent to learn the mouse movements necessary to best perform some task in a reinforcement learning setting (i.e. the reward signal is the only feedback for learning).

I'm hoping to use the Q-learning technique, but while I've found a way to extend this method to continuous state spaces, I can't seem to figure out how to accommodate a problem with a continuous action space.

I could just force all mouse movement to be of a certain magnitude and in only a certain number of different directions, but any reasonable way of making the actions discrete would yield a huge action space. Since standard Q-learning requires the agent to evaluate all possible actions, such an approximation doesn't solve the problem in any practical sense.

711

asked Aug 17 '11 19:08

zergylord

1 Answers

The common way of dealing with this problem is with actor-critic methods. These naturally extend to continuous action spaces. Basic Q-learning could diverge when working with approximations, however, if you still want to use it, you can try combining it with a self-organizing map, as done in "Applications of the self-organising map to reinforcement learning". The paper also contains some further references you might find useful.

121

answered Sep 23 '22 11:09

Don Reba

Related questions
                            
                                Difference between Big-Theta and Big O notation in simple language
                            
                                How can I sort a std::map first by value, then by key?
                            
                                How external merge sort algorithm works?
                            
                                Finding n-th permutation without computing others
                            
                                In Python, what is the fastest algorithm for removing duplicates from a list so that all elements are unique *while preserving order*? [duplicate]
                            
                                How to efficiently calculate a row in pascal's triangle?
                            
                                Elegant Python code for Integer Partitioning [closed]
                            
                                C# Point in polygon
                            
                                What is the benefit for a sort algorithm to be stable?
                            
                                Family Tree Algorithm
                            
                                Why does Dijkstra's algorithm work?
                            
                                Fast Algorithm to Quickly Find the Range a Number Belongs to in a Set of Ranges?
                            
                                Check if a spelled number is in a range in C++
                            
                                Hashing a Tree Structure
                            
                                Rotating an array using Juggling algorithm
                            
                                Create your own MD5 collisions
                            
                                Given a 1 TB data set on disk with around 1 KB per data record, how can I find duplicates using 512 MB RAM and infinite disk space?
                            
                                Calculating which tiles are lit in a tile-based game ("raytracing")
                            
                                Fast n choose k mod p for large n?
                            
                                Rebalancing an arbitrary BST?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I apply reinforcement learning to continuous action spaces?

Tags:

algorithm

machine-learning

reinforcement-learning

q-learning

zergylord

People also ask

1 Answers

Don Reba

Recent Activity

Donate For Us