Training a Neural Network with Reinforcement learning

Tags:

I know the basics of feedforward neural networks, and how to train them using the backpropagation algorithm, but I'm looking for an algorithm than I can use for training an ANN online with reinforcement learning.

For example, the cart pole swing up problem is one I'd like to solve with an ANN. In that case, I don't know what should be done to control the pendulum, I only know how close I am to the ideal position. I need to have the ANN learn based on reward and punishment. Thus, supervised learning isn't an option.

Another situation is something like the snake game, where feedback is delayed, and limited to goals and anti-goals, rather than reward.

I can think of some algorithms for the first situation, like hill-climbing or genetic algorithms, but I'm guessing they would both be slow. They might also be applicable in the second scenario, but incredibly slow, and not conducive to online learning.

My question is simple: Is there a simple algorithm for training an artificial neural network with reinforcement learning? I'm mainly interested in real-time reward situations, but if an algorithm for goal-based situations is available, even better.

638

asked May 23 '12 14:05

Kendall Frey

1 Answers

There are some research papers on the topic:

Efficient Reinforcement Learning Through Evolving Neural Network Topologies (2002)
Reinforcement Learning Using Neural Networks, with Applications to Motor Control
Reinforcement Learning Neural Network To The Problem Of Autonomous Mobile Robot Obstacle Avoidance

And some code:

Code examples for neural network reinforcement learning.

Those are just some of the top google search results on the topic. The first couple of papers look like they're pretty good, although I haven't read them personally. I think you'll find even more information on neural networks with reinforcement learning if you do a quick search on Google Scholar.

151

answered Sep 19 '22 04:09

Kiril

Related questions
                            
                                Why is merge sort worst case run time O (n log n)?
                            
                                Find duplicate element in array in time O(n)
                            
                                How can I compare two sets of 1000 numbers against each other?
                            
                                Factorial Algorithms in different languages
                            
                                Is there an efficient algorithm to generate a 2D concave hull?
                            
                                Generate all binary strings of length n with k bits set
                            
                                Random weighted choice
                            
                                Getting the submatrix with maximum sum?
                            
                                search for interval overlap in list of intervals?
                            
                                Difference between back tracking and dynamic programming
                            
                                Is the time complexity of the empty algorithm O(0)?
                            
                                How to find maximum spanning tree?
                            
                                How to find nth element from the end of a singly linked list?
                            
                                Algorithm to mix sound
                            
                                Cycles in an Undirected Graph
                            
                                How to get a random element from a C++ container?
                            
                                Generate all combinations from multiple lists
                            
                                Calculate median in c#
                            
                                Algorithm to calculate number of intersecting discs
                            
                                Find the row representing the smallest integer in row wise sorted matrix

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Training a Neural Network with Reinforcement learning

Tags:

language-agnostic

algorithm

machine-learning

neural-network

reinforcement-learning

Kendall Frey

People also ask

1 Answers

Kiril

Recent Activity

Donate For Us