Resilient backpropagation neural network - question about gradient

Tags:

backpropagation

First I want to say that I'm really new to neural networks and I don't understand it very good ;)

I've made my first C# implementation of the backpropagation neural network. I've tested it using XOR and it looks it work.

Now I would like change my implementation to use resilient backpropagation (Rprop - http://en.wikipedia.org/wiki/Rprop).

The definition says: "Rprop takes into account only the sign of the partial derivative over all patterns (not the magnitude), and acts independently on each "weight".

Could somebody tell me what partial derivative over all patterns is? And how should I compute this partial derivative for a neuron in hidden layer.

Thanks a lot

UPDATE:

My implementation base on this Java code: www_.dia.fi.upm.es/~jamartin/downloads/bpnn.java

My backPropagate method looks like this:

public double backPropagate(double[] targets)
    {
        double error, change;

        // calculate error terms for output
        double[] output_deltas = new double[outputsNumber];

        for (int k = 0; k < outputsNumber; k++)
        {

            error = targets[k] - activationsOutputs[k];
            output_deltas[k] = Dsigmoid(activationsOutputs[k]) * error;
        }

        // calculate error terms for hidden
        double[] hidden_deltas = new double[hiddenNumber];

        for (int j = 0; j < hiddenNumber; j++)
        {
            error = 0.0;

            for (int k = 0; k < outputsNumber; k++)
            {
                error = error + output_deltas[k] * weightsOutputs[j, k];
            }

            hidden_deltas[j] = Dsigmoid(activationsHidden[j]) * error;
        }

        //update output weights
        for (int j = 0; j < hiddenNumber; j++)
        {
            for (int k = 0; k < outputsNumber; k++)
            {
                change = output_deltas[k] * activationsHidden[j];
                weightsOutputs[j, k] = weightsOutputs[j, k] + learningRate * change + momentumFactor * lastChangeWeightsForMomentumOutpus[j, k];
                lastChangeWeightsForMomentumOutpus[j, k] = change;

            }
        }

        // update input weights
        for (int i = 0; i < inputsNumber; i++)
        {
            for (int j = 0; j < hiddenNumber; j++)
            {
                change = hidden_deltas[j] * activationsInputs[i];
                weightsInputs[i, j] = weightsInputs[i, j] + learningRate * change + momentumFactor * lastChangeWeightsForMomentumInputs[i, j];
                lastChangeWeightsForMomentumInputs[i, j] = change;
            }
        }

        // calculate error
        error = 0.0;

        for (int k = 0; k < outputsNumber; k++)
        {
            error = error + 0.5 * (targets[k] - activationsOutputs[k]) * (targets[k] - activationsOutputs[k]);
        }

        return error;
    }

So can I use change = hidden_deltas[j] * activationsInputs[i] variable as a gradient (partial derivative) for checking the sing?

702

asked May 19 '10 11:05

1 Answers

I think the "over all patterns" simply means "in every iteration"... take a look at the RPROP paper

For the paritial derivative: you've already implemented the normal back-propagation algorithm. This is a method for efficiently calculate the gradient... there you calculate the δ values for the single neurons, which are in fact the negative ∂E/∂w values, i.e. the parital derivative of the global error as function of the weights.

so instead of multiplying the weights with these values, you take one of two constants (η+ or η-), depending on whether the sign has changed

133

answered Oct 10 '22 04:10

king_nak

Related questions
                            
                                Back propagation from decoder input to encoder output in variational autoencoder
                            
                                Neural Networks: Minimal, open source example with exhaustive training data?
                            
                                Training neural network in Ruby
                            
                                How to build a simple RNN with a cycle in the graph in TensorFlow?
                            
                                Tensorflow - loss starts high and does not decrease
                            
                                Low Volatile GPU Utilization when running tensor flow
                            
                                Tensorflow: Saving/Restoring session, checkpoint, metagraph
                            
                                AttributeError: module 'keras' has no attribute 'initializers'
                            
                                tensorflow neural network multi layer perceptron for regression example
                            
                                adjusted fitness in NEAT algorithm
                            
                                Get the loss that a given optimizer is minimizing in Tensorflow
                            
                                Run a Tensorflow model without having Tensorflow installed
                            
                                Deep multi-task learning with missing labels
                            
                                I am getting an accuracy of 1.0 every time in neural network
                            
                                How to do transfer learning for yolo object detection algorithm?
                            
                                Creating a neural network in keras to multiply two input integers
                            
                                How Batch learning in Pytorch is performed?
                            
                                Can you reverse a PyTorch neural network and activate the inputs from the outputs?
                            
                                Why does almost every Activation Function Saturate at Negative Input Values in a Neural Network
                            
                                How do filters run across an RGB image, in first layer of a CNN?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Resilient backpropagation neural network - question about gradient

Tags:

neural-network

backpropagation

Rafal Spacjer

People also ask

1 Answers

king_nak

Recent Activity

Donate For Us