Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Wordnet (Word Sense Annotated) Corpus

I've been utilizing lots of different corpora for natural language processing, and I've been looking for a corpus that has been annotated with Wordnet Word Senses.

I understand that there probably is not a big corpus with this information, since the corpus needs to be built up manually, but there has to be something to go off of.

Also if there isn't a corpus in existence, is there at least a sense annotated ngram database (with what percentage of the time a word is each of its definitions, or a numerical count of each wordnet definition depending on how common the word sense is)?

like image 538
cardine Avatar asked Jan 18 '23 19:01

cardine


2 Answers

Three prominent corpora annotated for WordNet:

  • MASC
  • WordNet gloss
  • SemCor
like image 146
cyborg Avatar answered Jan 20 '23 09:01

cyborg


Some of the SENSEVAL (now SEMEVAL) data is annotated with WordNet.

like image 42
Fabian Steeg Avatar answered Jan 20 '23 09:01

Fabian Steeg