Word sense disambiguation in NLTK Python

Tags:

I am new to NLTK Python and i am looking for some sample application which can do word sense disambiguation. I have got a lot of algorithms in search results but not a sample application. I just want to pass a sentence and want to know the sense of each word by referring to wordnet library. Thanks

I have found a similar module in PERL. http://marimba.d.umn.edu/allwords/allwords.html Is there such module present in NLTK Python?

910

asked Sep 13 '10 11:09

thesensemakers

1 Answers

Recently, part of the pywsd code has been ported into the bleeding edge version of NLTK' in the wsd.py module, try:

>>> from nltk.wsd import lesk >>> sent = 'I went to the bank to deposit my money' >>> ambiguous = 'bank' >>> lesk(sent, ambiguous) Synset('bank.v.04') >>> lesk(sent, ambiguous).definition() u'act as the banker in a game or in gambling'

For better WSD performance, use the pywsd library instead of the NLTK module. In general, simple_lesk() from pywsd does better than lesk from NLTK. I'll try to update the NLTK module as much as possible when I'm free.

In responds to Chris Spencer's comment, please note the limitations of Lesk algorithms. I'm simply giving an accurate implementation of the algorithms. It's not a silver bullet, http://en.wikipedia.org/wiki/Lesk_algorithm

Also please note that, although:

lesk("My cat likes to eat mice.", "cat", "n")

don't give you the right answer, you can use pywsd implementation of max_similarity():

>>> from pywsd.similarity import max_similiarity >>> max_similarity('my cat likes to eat mice', 'cat', 'wup', pos='n').definition  'feline mammal usually having thick soft fur and no ability to roar: domestic cats; wildcats' >>> max_similarity('my cat likes to eat mice', 'cat', 'lin', pos='n').definition  'feline mammal usually having thick soft fur and no ability to roar: domestic cats; wildcats'

@Chris, if you want a python setup.py , just do a polite request, i'll write it...

answered Oct 13 '22 01:10

alvas

Related questions
                            
                                Is stdout line buffered, unbuffered or indeterminate by default?
                            
                                Can you cancel a PayPal automatic payment via API? (Subscription created via Hosted button)
                            
                                Order a ObservableCollection<T> without creating a new one [duplicate]
                            
                                How to disable Emacs-Flymake for html mode
                            
                                Match all files under all nested directories with shell globbing
                            
                                Scala Parsers: Availability, Differences and Combining?
                            
                                What is a Projection in NHibernate?
                            
                                Small libc for embedded systems [closed]
                            
                                jquery background-color change on focus and blur
                            
                                c++ mark enum value as deprecated?
                            
                                Understanding javap's output for the Constant Pool
                            
                                How does C# compiler remove Debug.Assert's in release builds?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With