Good algorithm for sentiment analysis

Tags:

I tried naive bayes classifier and it's working very bad. SVM works a little better but still horrible. Most of the papers which i read about SVM and naive bayes with some variations(n-gram, POS etc) but all of them gives results close to 50% (authors of articles talk about 80% and high but i cannt to get same accurate on real data).

Is there any more powerfull methods except lexixal analys? SVM and Bayes suppose that words independet. These approach called "bag of words". What if we suppose that words are associated?

For example: Use apriory algorithm to detect that if sentences contains "bad and horrible" then 70% probality that sentence is negative. Also we can use distance between words and so on.

Is it good idea or i'm inventing bicycle?

822

asked Jun 11 '12 14:06

Neir0

2 Answers

You're confusing a couple of concepts here. Neither Naive Bayes nor SVMs are tied to the bag of words approach. Neither SVMs nor the BOW approach have an independence assumption between terms.

Here's some things you can try:

include punctuation marks in your bags of words; esp. ! and ? can be helpful for sentiment analysis, while many feature extractors geared toward document classification throw them away
same for stop words: words like "I" and "my" may be indicative of subjective text
build a two-stage classifier; first determine whether any opinion is expressed, then whether it's positive or negative
try a quadratic kernel SVM instead of a linear one to capture interactions between features.

answered Nov 05 '22 01:11

Fred Foo

Algorithms like SVM, Naive Bayes and maximum entropy ones are supervised machine learning algorithms and the output of your program depends on the training set you have provided. For large scale sentiment analysis I prefer using unsupervised learning method in which one can determine the sentiments of the adjectives by clustering documents into same-oriented parts, and label the clusters positive or negative. More information can be found out from this paper. http://icwsm.org/papers/3--Godbole-Srinivasaiah-Skiena.pdf

Hope this helps you in your work :)

answered Nov 05 '22 00:11

Aravind Asok

Related questions
                            
                                How does one go about reverse engineering an algorithm?
                            
                                Binary tree from in-order and level-order traversals?
                            
                                Generate all strings under length N in C
                            
                                Any ideas about INTERESTING algorithm problems and examples for my students
                            
                                Overlapping Intervals
                            
                                The fastest algorithm determine range overlap
                            
                                Counting the number of substrings
                            
                                Suggestions needed for optimizing O(n^2) algorithm
                            
                                integer nth root
                            
                                Big O Analysis with Recursion Tree of Stooge Sort
                            
                                Clustering 2d integer coordinates into sets of at most N points
                            
                                Tree visualization algorithm
                            
                                Finding even numbers in an array without using feedback
                            
                                How many common English words of 4 letters or more can you make from the letters of a given word (each letter can only be used once)
                            
                                Shopping cart minimization algorithm
                            
                                Is there a way to avoid the linear search on this?
                            
                                A good sorting algorithm for mostly-sorted data that doesn't all fit into memory? [closed]
                            
                                Sorted insertion into small (255 element) list
                            
                                How revision control system restores revision?
                            
                                Find the N-th most frequent number in the array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Good algorithm for sentiment analysis

Tags:

algorithm

sentiment-analysis

Neir0

People also ask

2 Answers

Fred Foo

Aravind Asok

Recent Activity

Donate For Us