We need to decide between Support Vector Machines and Fast Artificial Neural Network for some text processing project. It includes Contextual Spelling Correction and then tagging the text to certain phrases and their synonyms. Which will be the right approach? Or is there an alternate to both of these... Something more appropriate than FANN as well as SVM?

I think you'll get a competitive results from both of the algorithms, so you should aggregate the results... think about ensemble learning. Update: I don't know if this is specific enough: use Bayes Optimal Classifier to combine the prediction from each algorithm. You have to train both of your algorithms, then you have to train the Bayes Optimal Classifier to use your algorithms and make optimal predictions based on the input of the algorithms. Separate your training data in 3: <ul> <li>1st data set will be used to train the (Artificial) Neural Network and the Support Vector Machines.</li> <li>2nd data set will be used to train the Bayes Optimal Classifier by taking the raw predictions from the ANN and SVM.</li> <li>3rd data set will be your qualification data set where you will test your trained Bayes Optimal Classifier.</li> </ul> Update 2.0: Another way to create an ensemble of the algorithms is to use 10-fold (or more generally, k-fold) cross-validation: <ul> <li>Break data into 10 sets of size n/10.</li> <li>Train on 9 datasets and test on 1.</li> <li>Repeat 10 times and take a mean accuracy. </li> </ul> Remember that you can generally combine many the classifiers and validation methods in order to produce better results. It's just a matter of finding what works best for your domain.

Support Vector Machine or Artificial Neural Network for text processing? [closed]

2 Answers

I think you'll get a competitive results from both of the algorithms, so you should aggregate the results... think about ensemble learning.

Update:
I don't know if this is specific enough: use Bayes Optimal Classifier to combine the prediction from each algorithm. You have to train both of your algorithms, then you have to train the Bayes Optimal Classifier to use your algorithms and make optimal predictions based on the input of the algorithms.

Separate your training data in 3:

1st data set will be used to train the (Artificial) Neural Network and the Support Vector Machines.
2nd data set will be used to train the Bayes Optimal Classifier by taking the raw predictions from the ANN and SVM.
3rd data set will be your qualification data set where you will test your trained Bayes Optimal Classifier.

Update 2.0:
Another way to create an ensemble of the algorithms is to use 10-fold (or more generally, k-fold) cross-validation:

Break data into 10 sets of size n/10.
Train on 9 datasets and test on 1.
Repeat 10 times and take a mean accuracy.

Remember that you can generally combine many the classifiers and validation methods in order to produce better results. It's just a matter of finding what works best for your domain.

answered Oct 14 '22 03:10

Kiril

You might want to also take a look at maxent classifiers (/log linear models).

They're really popular for NLP problems. Modern implementations, which use quasi-newton methods for optimization rather than the slower iterative scaling algorithms, train more quickly than SVMs. They also seem to be less sensitive to the exact value of the regularization hyperparameter. You should probably only prefer SVMs over maxent, if you'd like to use a kernel to get feature conjunctions for free.

As for SVMs vs. neural networks, using SVMs would probably be better than using ANNs. Like maxent models, training SVMs is a convex optimization problem. This means, given a data set and a particular classifier configuration, SVMs will consistently find the same solution. When training multilayer neural networks, the system can converge to various local minima. So, you'll get better or worse solutions depending on what weights you use to initialize the model. With ANNs, you'll need to perform multiple training runs in order to evaluate how good or bad a given model configuration is.

answered Oct 14 '22 04:10

dmcer

Related questions
                            
                                Can evolutionary computation be a method of reinforcement learning?
                            
                                What are some games with fairly simple heuristics to evaluate positions? [closed]
                            
                                Crossover operation in genetic algorithm for TSP
                            
                                keras error on predict
                            
                                Algorithm: shortest path between all points
                            
                                Fastest way to store a numpy array in redis
                            
                                Incorporating user feedback in a ML model
                            
                                Implementing the TD-Gammon algorithm
                            
                                Continuous vs Discrete artificial neural networks
                            
                                Can ReLU handle a negative input?
                            
                                NLP and Machine learning for sentiment analysis [closed]
                            
                                How to design the artificial intelligence of a fighting game (Street Fighter or Soul Calibur)?
                            
                                Correct formulation of the A* algorithm
                            
                                How to use transposition tables with MTD(f)
                            
                                What is the difference between SOM (Self Organizing Maps) and K-Means?
                            
                                How to Find Documents That are in the same Cluster with KMeans
                            
                                PyTorch Binary Classification - same network structure, 'simpler' data, but worse performance?
                            
                                scaling inputs data to neural network
                            
                                what is meaning of hook that used in tensorflow
                            
                                What is the difference between search and planning

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Support Vector Machine or Artificial Neural Network for text processing? [closed]

Tags:

artificial-intelligence

machine-learning

neural-network

Arc

People also ask

2 Answers

Kiril

dmcer

Recent Activity

Donate For Us