Step function versus Sigmoid function

2 Answers

The (Heaviside) step function is typically only useful within single-layer perceptrons, an early type of neural networks that can be used for classification in cases where the input data is linearly separable.

However, multi-layer neural networks or multi-layer perceptrons are of more interest because they are general function approximators and they are able to distinguish data that is not linearly separable.

Multi-layer perceptrons are trained using backpropapagation. A requirement for backpropagation is a differentiable activation function. That's because backpropagation uses gradient descent on this function to update the network weights.

The Heaviside step function is non-differentiable at x = 0 and its derivative is 0 elsewhere. This means gradient descent won't be able to make progress in updating the weights and backpropagation will fail.

The sigmoid or logistic function does not have this shortcoming and this explains its usefulness as an activation function within the field of neural networks.

answered Sep 24 '22 11:09

Eric

It depends on the problem you are dealing with. In case of simple binary classification, a step function is appropriate. Sigmoids can be useful when building more biologically realistic networks by introducing noise or uncertainty. Another but compeletely different use of sigmoids is for numerical continuation, i.e. when doing bifurcation analysis with respect to some parameter in the model. Numerical continuation is easier with smooth systems (and very tricky with non-smooth ones).

answered Sep 22 '22 11:09

itsok-dontworry

Related questions
                            
                                OCR algorithm improvement
                            
                                Multi dimensional input for LSTM in Keras
                            
                                Convolutional neural network - How to get the feature maps?
                            
                                Is it possible to run a neural network in reverse?
                            
                                Things to try when Neural Network not Converging
                            
                                InfogainLoss layer
                            
                                Are there some pre-trained LSTM, RNN or ANN models for time-series prediction?
                            
                                ValueError «You are trying to use the old GPU back-end» when importing keras
                            
                                Tensorflow model zoo?
                            
                                How to record val_loss and loss per batch in keras
                            
                                Can I send callbacks to a KerasClassifier?
                            
                                float16 vs float32 for convolutional neural networks
                            
                                Convert sklearn.svm SVC classifier to Keras implementation
                            
                                neural networks can't figure out Fourier transforms? [closed]
                            
                                When bulding a CNN, I am getting complaints from Keras that do not make sense to me.
                            
                                Is there a common format for neural networks
                            
                                How to iterate over layers in Pytorch
                            
                                Keras Sequential model input layer
                            
                                What is the difference between these two ways of saving keras machine learning model weights?
                            
                                What exactly is gradient checking?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Step function versus Sigmoid function

Tags:

neural-network

Jay Schauer

People also ask

2 Answers

Eric

itsok-dontworry

Recent Activity

Donate For Us