How to create simple 3-layer neural network and teach it using supervised learning?

Tags:

Based on PyBrain's tutorials I managed to knock together the following code:

#!/usr/bin/env python2
# coding: utf-8

from pybrain.structure import FeedForwardNetwork, LinearLayer, SigmoidLayer, FullConnection
from pybrain.datasets import SupervisedDataSet
from pybrain.supervised.trainers import BackpropTrainer

n = FeedForwardNetwork()

inLayer = LinearLayer(2)
hiddenLayer = SigmoidLayer(3)
outLayer = LinearLayer(1)

n.addInputModule(inLayer)
n.addModule(hiddenLayer)
n.addOutputModule(outLayer)

in_to_hidden = FullConnection(inLayer, hiddenLayer)
hidden_to_out = FullConnection(hiddenLayer, outLayer)

n.addConnection(in_to_hidden)
n.addConnection(hidden_to_out)

n.sortModules()

ds = SupervisedDataSet(2, 1)
ds.addSample((0, 0), (0,))
ds.addSample((0, 1), (1,))
ds.addSample((1, 0), (1,))
ds.addSample((1, 1), (0,))

trainer = BackpropTrainer(n, ds)
# trainer.train()
trainer.trainUntilConvergence()

print n.activate([0, 0])[0]
print n.activate([0, 1])[0]
print n.activate([1, 0])[0]
print n.activate([1, 1])[0]

It's supposed to learn XOR function, but the results seem quite random:

0.208884929522

0.168926515771

0.459452834043

0.424209192223

0.84956138664

0.888512762786

0.564964077401

0.611111147862

613

asked Sep 18 '15 15:09

Luke

1 Answers

There are four problems with your approach, all easy to identify after reading Neural Network FAQ:

Why use a bias/threshold?: you should add a bias node. Lack of bias makes the learning very limited: the separating hyperplane represented by the network can only pass through the origin. With the bias node, it can move freely and fit the data better:
```
bias = BiasUnit()
n.addModule(bias)

bias_to_hidden = FullConnection(bias, hiddenLayer)
n.addConnection(bias_to_hidden)
```
Why not code binary inputs as 0 and 1?: all your samples lay in a single quadrant of the sample space. Move them to be scattered around the origin:
```
ds = SupervisedDataSet(2, 1)
ds.addSample((-1, -1), (0,))
ds.addSample((-1, 1), (1,))
ds.addSample((1, -1), (1,))
ds.addSample((1, 1), (0,))
```
^{(Fix the validation code at the end of your script accordingly.)}
trainUntilConvergence method works using validation, and does something that resembles the early stopping method. This doesn't make sense for such a small dataset. Use trainEpochs instead. 1000 epochs is more than enough for this problem for the network to learn:
```
trainer.trainEpochs(1000)
```
What learning rate should be used for backprop?: Tune the learning rate parameter. This is something you do every time you employ a neural network. In this case, the value 0.1 or even 0.2 dramatically increases the learning speed:
```
trainer = BackpropTrainer(n, dataset=ds, learningrate=0.1, verbose=True)
```
^{(Note the verbose=True parameter. Observing how the error behaves is essential when tuning parameters.)}

With these fixes I get consistent, and correct results for the given network with the given dataset, and error less than 1e-23.

114

answered Sep 20 '22 13:09

BartoszKP

Related questions
                            
                                Why can't I import statsmodels directly?
                            
                                Add Timestamp to ElasticSearch with Elasticsearch-py using Bulk-API
                            
                                modern approach to 3D visualization in python: discuss mayavi
                            
                                How to detect write failure in asyncio?
                            
                                Django admin asks for login after every click
                            
                                Pycharm: How to adjust color of variable/syntax highlighting?
                            
                                Numpy String Encoding
                            
                                PyStruct - No matching signature find
                            
                                Is it possible to Bulk Insert using Google Cloud Datastore
                            
                                Sending Large CSV to Kafka using python Spark
                            
                                Python rcParams error
                            
                                Flask JSONEncoder set ensure_ascii to False
                            
                                Comparison of multi-line strings in Python unit test
                            
                                How to create a db file in sqlite3 using a schema file from within python
                            
                                openpyxl font conditional formatting
                            
                                Python wordlist permutation
                            
                                Select an item from a set in Python
                            
                                "cross product" but raise to exponent instead of multiply
                            
                                How to manage precedence in argparse?
                            
                                Attribute error in python Locks

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to create simple 3-layer neural network and teach it using supervised learning?

Tags:

python

python-2.7

pybrain

Luke

People also ask

1 Answers

BartoszKP

Recent Activity

Donate For Us