Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to get the WordNet synset given an offset ID?

Tags:

I have a WordNet synset offset (for example id="n#05576222"). Given this offset, how can I get the synset using Python?

like image 894
user1039457 Avatar asked Nov 10 '11 09:11

user1039457


People also ask

What is a WordNet Synset?

WordNet is the lexical database i.e. dictionary for the English language, specifically designed for natural language processing. Synset is a special kind of a simple interface that is present in NLTK to look up words in WordNet. Synset instances are the groupings of synonymous words that express the same concept.


1 Answers

As of NLTK 3.2.3, there's a public method for doing this:

wordnet.synset_from_pos_and_offset(pos, offset)

In earlier versions you can use:

wordnet._synset_from_pos_and_offset(pos, offset)

This returns a synset based on it's POS and offest ID. I think this method is only available in NLTK 3.0 but I'm not sure.

Example:

from nltk.corpus import wordnet as wn
wn.synset_from_pos_and_offset('n',4543158)
>> Synset('wagon.n.01')
like image 179
donners45 Avatar answered Jan 03 '23 16:01

donners45