What's the point of the threshold in a perceptron?

Tags:

I'm having trouble seeing what the threshold actually does in a single-layer perceptron. The data is usually separated no matter what the value of the threshold is. It seems a lower threshold divides the data more equally; is this what it is used for?

419

asked Jul 02 '11 02:07

Hypercube

3 Answers

Actually, you'll just set threshold when you aren't using bias. Otherwise, the threshold is 0.

Remember that, a single neuron divides your input space with a hyperplane. Ok?

Now imagine a neuron with 2 inputs X=[x1, x2], 2 weights W=[w1, w2] and threshold TH. The equation shows how this neuron works:

x1.w1 + x2.w2 = TH

this is equals to:

x1.w1 + x2.w2 - 1.TH = 0

I.e., this is your hyperplane equation that will divides the input space.

Notice that, this neuron just work if you set manually the threshold. The solution is change TH to another weight, so:

x1.w1 + x2.w2 - 1.w0 = 0

Where the term 1.w0 is your BIAS. Now you still can draw a plane in your input space without set manually a threshold (i.e, threshold is always 0). But, in case you set the threshold to another value, the weights will just adapt themselves to adjust equation, i.e., weights (INCLUDING BIAS) absorves the threshold effects.

answered Nov 11 '22 06:11

renatopp

The sum of the products of the weights and the inputs is calculated in each node, and if the value is above some threshold (typically 0) the neuron fires and takes the activated value (typically 1); otherwise it takes the deactivated value (typically -1). Neurons with this kind of activation function are also called Artificial neurons or linear threshold units.

answered Nov 11 '22 07:11

Patrick Desjardins

I think I understand now, with help from Daok. I just wanted to add information for other people to find.

The equation for the separator for a single-layer perceptron is

Σw_jx_j+bias=threshold

This means that if the input is higher than the threshold, or

Σw_jx_j+bias > threshold, it gets classified into one category, and if

Σw_jx_j+bias < threshold, it get classified into the other.

The bias and the threshold really serve the same purpose, to translate the line (see Role of Bias in Neural Networks). Being on opposite sides of the equation, though, they are "negatively proportional".

For example, if the bias was 0 and the threshold 0.5, this would be equivalent to a bias of -0.5 and a threshold of 0.

answered Nov 11 '22 05:11

Hypercube

Related questions
                            
                                Simple tic-tac-toe AI [closed]
                            
                                What is the difference between naive and semi naive evaluation?
                            
                                Hexagonal Self-Organizing map in Python
                            
                                Algorithm to generate numerical concept hierarchy
                            
                                What's artificial intelligence? [closed]
                            
                                What's the utility theory in artificial intelligence?
                            
                                AI for Objective C
                            
                                Why is the x variable tensor reshaped with -1 in the MNIST tutorial for tensorflow?
                            
                                ValueError: Unknown activation function: my_custom_activation_function
                            
                                Pacman Ghost AI
                            
                                What are the problems associated to Best First Search in Artificial intelligence?
                            
                                Measuring the performance of classification algorithm
                            
                                How to reuse saved classifier created from explorer(in weka) in eclipse java
                            
                                Why do neural networks work so well?
                            
                                What are the differences between Monte Carlo and Markov chains techniques?
                            
                                Generating non-uniform random numbers [closed]
                            
                                Is it theoretically possible to emulate a human brain on a computer? [closed]
                            
                                How is Manhattan distance an admissible heuristic?
                            
                                Quantum Tic-Tac-Toe AI [closed]
                            
                                How do you derive the time complexity of alpha-beta pruning?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the point of the threshold in a perceptron?

Tags:

artificial-intelligence

neural-network

perceptron

Hypercube

People also ask

3 Answers

renatopp

Patrick Desjardins

Hypercube

Recent Activity

Donate For Us