Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

sentiment analysis - wordNet , sentiWordNet lexicon

I need a list of positive and negative words with the weights assigned to words according to how strong and week they are. I have got :

1.) WordNet - It gives a + or - score for every word.

2.) SentiWordNet - Giving positive and negative values in the range [0,1].

I checked these on few words,

love - wordNet is giving 0.0 for both noun and verb, I dont know why i think it should be positive by at least some factor.

repress - wordNet gives -9.93
- SentiWordNet gives - 0.0 for both pos and neg. (should be negative)

repose - wordNet - 2.488
- SentiWordNet - { pos - 0.125, neg - 0.5 } (should be positive)

I need some help to decide which one to use.

Thanks.

like image 600
Naveen Avatar asked Oct 13 '12 02:10

Naveen


People also ask

What is WordNet in sentiment analysis?

1.) WordNet - It gives a + or - score for every word. 2.) SentiWordNet - Giving positive and negative values in the range [0,1].

What is lexicon in sentiment analysis?

One of the approaches or techniques of semantic analysis is the lexicon-based approach. This technique calculates the sentiment orientations of the whole document or set of sentence(s) from semantic orientation of lexicons. Semantic orientation can be positive, negative, or neutral.

What is Afinn lexicon?

The AFINN lexicon is a list of English terms manually rated for valence with an integer between -5 (negative) and +5 (positive) by Finn Årup Nielsen between 2009 and 2011. The original lexicon contains some multi-word phrases, but they are excluded here.


2 Answers

Quite often the degree and/or polarity may depend on the domain and/or the context, so the word alone isn't really enough to make a decision.

If you have some annotated data, I suggest training a classifier on that using the scores provided by the two resources as features. If you don't, one option is to use one of the available sentiment-annotated corpora that matches the domain in question. Without any data at all the whole task becomes somewhat tricky, although there is a substantial body of work on unsupervised approaches to sentiment classification, I believe, see, e.g. Unsupervised Sentiment Analysis

like image 181
Qnan Avatar answered Sep 28 '22 19:09

Qnan


There is an interface to give different opinions for SentiWordNet, if you think they are wrong: http://sentiwordnet.isti.cnr.it/search.php?q=repose

I downloaded latest Wordnet 3.1, and checked the file format documentation, and don't see any mention of the sentiment numbers you mention. It is also not shown in the online search.

So, for both those reasons I'd suggest going with SentiWordNet!

(I see your question is a year old, so perhaps you can tell us what you did go with, and why?)

like image 25
Darren Cook Avatar answered Sep 28 '22 18:09

Darren Cook