What is difference between SVM and Neural Network? Is it true that linear svm is same NN, and for non-linear separable problems, NN uses adding hidden layers and SVM uses changing space dimensions?

NNs are heuristic, while SVMs are theoretically founded. A SVM is guaranteed to converge towards the best solution in the PAC (probably approximately correct) sense. For example, for two linearly separable classes SVM will draw the separating hyperplane directly halfway between the nearest points of the two classes (these become support vectors). A neural network would draw any line which separates the samples, which is correct for the training set, but might not have the best generalization properties. So no, even for linearly separable problems NNs and SVMs are not same. In case of linearly non-separable classes, both SVMs and NNs apply non-linear projection into higher-dimensional space. In the case of NNs this is achieved by introducing additional neurons in the hidden layer(s). For SVMs, a kernel function is used to the same effect. A neat property of the kernel function is that the computational complexity doesn't rise with the number of dimensions, while for NNs it obviously rises with the number of neurons.

SVM and Neural Network

2 Answers

There are two parts to this question. The first part is "what is the form of function learned by these methods?" For NN and SVM this is typically the same. For example, a single hidden layer neural network uses exactly the same form of model as an SVM. That is:

Given an input vector x, the output is: output(x) = sum_over_all_i weight_i * nonlinear_function_i(x)

Generally the nonlinear functions will also have some parameters. So these methods need to learn how many nonlinear functions should be used, what their parameters are, and what the value of all the weight_i weights should be.

Therefore, the difference between a SVM and a NN is in how they decide what these parameters should be set to. Usually when someone says they are using a neural network they mean they are trying to find the parameters which minimize the mean squared prediction error with respect to a set of training examples. They will also almost always be using the stochastic gradient descent optimization algorithm to do this. SVM's on the other hand try to minimize both training error and some measure of "hypothesis complexity". So they will find a set of parameters that fits the data but also is "simple" in some sense. You can think of it like Occam's razor for machine learning. The most common optimization algorithm used with SVMs is sequential minimal optimization.

Another big difference between the two methods is that stochastic gradient descent isn't guaranteed to find the optimal set of parameters when used the way NN implementations employ it. However, any decent SVM implementation is going to find the optimal set of parameters. People like to say that neural networks get stuck in a local minima while SVMs don't.

131

answered Sep 22 '22 23:09

Davis King

NNs are heuristic, while SVMs are theoretically founded. A SVM is guaranteed to converge towards the best solution in the PAC (probably approximately correct) sense. For example, for two linearly separable classes SVM will draw the separating hyperplane directly halfway between the nearest points of the two classes (these become support vectors). A neural network would draw any line which separates the samples, which is correct for the training set, but might not have the best generalization properties.

So no, even for linearly separable problems NNs and SVMs are not same.

In case of linearly non-separable classes, both SVMs and NNs apply non-linear projection into higher-dimensional space. In the case of NNs this is achieved by introducing additional neurons in the hidden layer(s). For SVMs, a kernel function is used to the same effect. A neat property of the kernel function is that the computational complexity doesn't rise with the number of dimensions, while for NNs it obviously rises with the number of neurons.

answered Sep 21 '22 23:09

Igor F.

Related questions
                            
                                Continuous output in Neural Networks
                            
                                Insert or delete a step in scikit-learn Pipeline
                            
                                How does Content-Aware fill work?
                            
                                Siri programming language [closed]
                            
                                How to recognize rectangles in this image?
                            
                                How to choose number of hidden layers and nodes in neural network? [closed]
                            
                                Rush Hour - Solving the game
                            
                                How can I program a simple chat bot AI?
                            
                                How to engineer features for machine learning [closed]
                            
                                AI of spaceship's propulsion: land a 3D ship at position=0 and angle=0
                            
                                How to optimally solve the flood fill puzzle?
                            
                                Prerequisites Needed to Read Books on Neural Networks (and understand them)
                            
                                How to create a smart chat-bot? [closed]
                            
                                Building a Texas Hold'em playing AI..from scratch [closed]
                            
                                Playground for Artificial Intelligence?
                            
                                Lisp and Prolog for Artificial Intelligence? [closed]
                            
                                How to make virtual organisms learn using neural networks? [closed]
                            
                                Is there any open source AI engine? [closed]
                            
                                Quadrilateral Shape Finding Algorithm
                            
                                What is the difference between Q-learning and Value Iteration?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SVM and Neural Network

Tags:

artificial-intelligence

machine-learning

neural-network

svm

CoyBit

People also ask

2 Answers

Davis King

Igor F.

Recent Activity

Donate For Us