What are advantages of Artificial Neural Networks over Support Vector Machines? [closed]

Tags:

ANN (Artificial Neural Networks) and SVM (Support Vector Machines) are two popular strategies for supervised machine learning and classification. It's not often clear which method is better for a particular project, and I'm certain the answer is always "it depends." Often, a combination of both along with Bayesian classification is used.

These questions on Stackoverflow have already been asked regarding ANN vs SVM:

ANN and SVM classification

what the difference among ANN, SVM and KNN in my classification question

Support Vector Machine or Artificial Neural Network for text processing?

In this question, I'd like to know specifically what aspects of an ANN (specifically, a Multilayer Perceptron) might make it desirable to use over an SVM? The reason I ask is because it's easy to answer the opposite question: Support Vector Machines are often superior to ANNs because they avoid two major weaknesses of ANNs:

(1) ANNs often converge on local minima rather than global minima, meaning that they are essentially "missing the big picture" sometimes (or missing the forest for the trees)

(2) ANNs often overfit if training goes on too long, meaning that for any given pattern, an ANN might start to consider the noise as part of the pattern.

SVMs don't suffer from either of these two problems. However, it's not readily apparent that SVMs are meant to be a total replacement for ANNs. So what specific advantage(s) does an ANN have over an SVM that might make it applicable for certain situations? I've listed specific advantages of an SVM over an ANN, now I'd like to see a list of ANN advantages (if any).

952

asked Jul 24 '12 13:07

Channel72

2 Answers

Judging from the examples you provide, I'm assuming that by ANNs, you mean multilayer feed-forward networks (FF nets for short), such as multilayer perceptrons, because those are in direct competition with SVMs.

One specific benefit that these models have over SVMs is that their size is fixed: they are parametric models, while SVMs are non-parametric. That is, in an ANN you have a bunch of hidden layers with sizes h₁ through h_n depending on the number of features, plus bias parameters, and those make up your model. By contrast, an SVM (at least a kernelized one) consists of a set of support vectors, selected from the training set, with a weight for each. In the worst case, the number of support vectors is exactly the number of training samples (though that mainly occurs with small training sets or in degenerate cases) and in general its model size scales linearly. In natural language processing, SVM classifiers with tens of thousands of support vectors, each having hundreds of thousands of features, is not unheard of.

Also, online training of FF nets is very simple compared to online SVM fitting, and predicting can be quite a bit faster.

EDIT: all of the above pertains to the general case of kernelized SVMs. Linear SVM are a special case in that they are parametric and allow online learning with simple algorithms such as stochastic gradient descent.

134

answered Sep 19 '22 13:09

Fred Foo

One obvious advantage of artificial neural networks over support vector machines is that artificial neural networks may have any number of outputs, while support vector machines have only one. The most direct way to create an n-ary classifier with support vector machines is to create n support vector machines and train each of them one by one. On the other hand, an n-ary classifier with neural networks can be trained in one go. Additionally, the neural network will make more sense because it is one whole, whereas the support vector machines are isolated systems. This is especially useful if the outputs are inter-related.

For example, if the goal was to classify hand-written digits, ten support vector machines would do. Each support vector machine would recognize exactly one digit, and fail to recognize all others. Since each handwritten digit cannot be meant to hold more information than just its class, it makes no sense to try to solve this with an artificial neural network.

However, suppose the goal was to model a person's hormone balance (for several hormones) as a function of easily measured physiological factors such as time since last meal, heart rate, etc ... Since these factors are all inter-related, artificial neural network regression makes more sense than support vector machine regression.

answered Sep 21 '22 13:09

Alan

Related questions
                            
                                Intuitive understanding of 1D, 2D, and 3D convolutions in convolutional neural networks [closed]
                            
                                Why do we have to normalize the input for an artificial neural network? [closed]
                            
                                How can I run Tensorboard on a remote server?
                            
                                Nearest neighbors in high-dimensional data? [closed]
                            
                                How to extract the decision rules from scikit-learn decision-tree?
                            
                                How to initialize weights in PyTorch?
                            
                                How can I one hot encode in Python?
                            
                                Why binary_crossentropy and categorical_crossentropy give different performances for the same problem?
                            
                                Is it possible to specify your own distance function using scikit-learn K-Means Clustering?
                            
                                How to split data into 3 sets (train, validation and test)?
                            
                                Difference between classification and clustering in data mining? [closed]
                            
                                Which machine learning classifier to choose, in general? [closed]
                            
                                Save classifier to disk in scikit-learn
                            
                                Is there a rule-of-thumb for how to divide a dataset into training and validation sets? [closed]
                            
                                How to interpret loss and accuracy for a machine learning model [closed]
                            
                                What is the difference between linear regression and logistic regression? [closed]
                            
                                How to implement the Softmax function in Python
                            
                                What is the difference between supervised learning and unsupervised learning? [closed]
                            
                                Convert array of indices to 1-hot encoded numpy array
                            
                                What is the meaning of the word logits in TensorFlow? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are advantages of Artificial Neural Networks over Support Vector Machines? [closed]

Tags:

machine-learning

neural-network

classification

svm

Channel72

People also ask

2 Answers

Fred Foo

Alan

Recent Activity

Donate For Us