I'm creating a neural network using the backpropagation technique for learning. I understand we need to find the derivative of the activation function used. I'm using the standard sigmoid function <pre class="prettyprint"><code>f(x) = 1 / (1 + e^(-x)) </code></pre> and I've seen that its derivative is <pre class="prettyprint"><code>dy/dx = f(x)' = f(x) * (1 - f(x)) </code></pre> This may be a daft question, but does this mean that we have to pass x through the sigmoid function twice during the equation, so it would expand to <pre class="prettyprint"><code>dy/dx = f(x)' = 1 / (1 + e^(-x)) * (1 - (1 / (1 + e^(-x)))) </code></pre> or is it simply a matter of taking the already calculated output of <code>f(x)</code>, which is the output of the neuron, and replace that value for <code>f(x)</code>?

Dougal is correct. Just do <pre class="prettyprint"><code>f = 1/(1+exp(-x)) df = f * (1 - f) </code></pre>

Derivative of sigmoid

Tags:

I'm creating a neural network using the backpropagation technique for learning.

I understand we need to find the derivative of the activation function used. I'm using the standard sigmoid function

f(x) = 1 / (1 + e^(-x))

and I've seen that its derivative is

dy/dx = f(x)' = f(x) * (1 - f(x))

This may be a daft question, but does this mean that we have to pass x through the sigmoid function twice during the equation, so it would expand to

dy/dx = f(x)' = 1 / (1 + e^(-x)) * (1 - (1 / (1 + e^(-x))))

or is it simply a matter of taking the already calculated output of f(x), which is the output of the neuron, and replace that value for f(x)?

561

asked May 16 '12 20:05

rflood89

1 Answers

Dougal is correct. Just do

f = 1/(1+exp(-x)) df = f * (1 - f)

answered Nov 12 '22 03:11

Bruno Kim

Related questions
                            
                                String's replaceAll() method and escape characters
                            
                                raw_input("") has been eliminated from python 3.2
                            
                                Is GC_FOR_ALLOC more "serious" when investigating memory usage?
                            
                                "The specified report server url could not be found" error while deploying to reporting server
                            
                                java.lang.NullPointerException: println needs a message [duplicate]
                            
                                meaning of ^I in vim and how to not show them?
                            
                                Python: get key of index in dictionary [duplicate]
                            
                                BIGINT UNSIGNED VALUE IS out of range My SQL
                            
                                Calling a function from string inside the same module in Python?
                            
                                How to simulate HTTP post request using Python Requests module?
                            
                                Convert date to different timezone [duplicate]
                            
                                How to implement the having clause in sqlite django ORM

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With