I am about making backpropagation on a neural network that uses ReLU. In a previous project of mine, I did it on a network that was using Sigmoid activation function, but now I'm a little bit confused, since ReLU doesn't have a derivative. Here's an image about how weight5 contributes to the total error. In this example, out/net = a*(1 - a) if I use sigmoid function. What should I write instead of "a*(1 - a)" to make the backpropagation work?

<blockquote> since ReLU doesn't have a derivative. </blockquote> No, ReLU has derivative. I assumed you are using ReLU function <code>f(x)=max(0,x)</code>. It means if <code>x<=0</code> then <code>f(x)=0</code>, else <code>f(x)=x</code>. In the first case, when <code>x<0</code> so the derivative of f(x) with respect to x gives result <code>f'(x)=0</code>. In the second case, it's clear to compute <code>f'(x)=1</code>.

ReLU derivative in backpropagation

Tags:

neural-network

backpropagation

sigmoid

relu

I am about making backpropagation on a neural network that uses ReLU. In a previous project of mine, I did it on a network that was using Sigmoid activation function, but now I'm a little bit confused, since ReLU doesn't have a derivative.

Here's an image about how weight5 contributes to the total error. In this example, out/net = a*(1 - a) if I use sigmoid function.

What should I write instead of "a*(1 - a)" to make the backpropagation work?

809

asked Feb 04 '17 16:02

Gergely Papp

1 Answers

since ReLU doesn't have a derivative.

No, ReLU has derivative. I assumed you are using ReLU function f(x)=max(0,x). It means if x<=0 then f(x)=0, else f(x)=x. In the first case, when x<0 so the derivative of f(x) with respect to x gives result f'(x)=0. In the second case, it's clear to compute f'(x)=1.

154

answered Oct 06 '22 11:10

malioboro

Related questions
                            
                                Facenet online triplet generation
                            
                                How to implement Tensorflow batch normalization in LSTM
                            
                                Quantize a Keras neural network model
                            
                                Why doesn't my Deep Q Network master a simple Gridworld (Tensorflow)? (How to evaluate a Deep-Q-Net)
                            
                                Generating new images with PyTorch
                            
                                Multi-layer neural network won't predict negative values
                            
                                Loss suddenly increases with Adam Optimizer in Tensorflow
                            
                                How to train a neural network to supervised data set using pybrain black-box optimization?
                            
                                Neural Network / Machine Learning memory storage
                            
                                How do you create a boolean mask for a tensor in Keras?
                            
                                what does class_mode parameter in Keras image_gen.flow_from_directory() signify?
                            
                                What kind of artificial intelligence jobs are out there? [closed]
                            
                                Time Series Prediction via Neural Networks
                            
                                I get error "Error in nnet.default(x, y, w, ...) : too many (77031) weights" while training neural networks
                            
                                caffe with multi-label images
                            
                                Convolutional neural network Conv1d input shape
                            
                                Custom Neural Network Implementation on MNIST using Tensorflow 2.0?
                            
                                What is a loss function in simple words?
                            
                                What is a `"Python"` layer in caffe?
                            
                                How training and test data is split - Keras on Tensorflow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With