To find synonyms, definitions and example sentences using WordNet

Tags:

I need to take an input text file with a one word. I then need to find the lemma_names, definition and examples of the synset of the word using wordnet. I have gone through the book : "Python Text Processing with NLTK 2.0 Cookbook" and also "Natural Language Processing using NLTK" to help me in this direction. Though I have understood how this can be done using the terminal, I'm not able to do the same using a text editor.

For example, if the input text has the word "flabbergasted", the output needs to be in this fashion:

flabbergasted (verb) flabbergast, boggle, bowl over - overcome with amazement ; "This boggles the mind!" (adjective) dumbfounded , dumfounded , flabbergasted , stupefied , thunderstruck , dumbstruck , dumbstricken - as if struck dumb with astonishment and surprise; "a circle of policement stood dumbfounded by her denial of having seen the accident"; "the flabbergasted aldermen were speechless"; "was thunderstruck by the news of his promotion"

The synsets, definitions and example sentences are obtained from WordNet directly!

I have the following piece of code:


from __future__ import division
import nltk
from nltk.corpus import wordnet as wn


tokenizer = nltk.data.load('tokenizers/punkt/english.pickle')
fp = open("inpsyn.txt")
data = fp.read()

#to tokenize input text into sentences

print '\n-----\n'.join(tokenizer.tokenize(data))# splits text into sentences

#to tokenize the tokenized sentences into words

tokens = nltk.wordpunct_tokenize(data)
text = nltk.Text(tokens)
words = [w.lower() for w in text]  
print words     #to print the tokens

for a in words:
    print a

syns = wn.synsets(a)
print "synsets:", syns

for s in syns:
    for l in s.lemmas:
        print l.name
    print s.definition
    print s.examples

I get the following output:


flabbergasted

['flabbergasted']
flabbergasted
synsets: [Synset('flabbergast.v.01'), Synset('dumbfounded.s.01')]
flabbergast
boggle
bowl_over
overcome with amazement
['This boggles the mind!']
dumbfounded
dumfounded
flabbergasted
stupefied
thunderstruck
dumbstruck
dumbstricken
as if struck dumb with astonishment and surprise
['a circle of policement stood dumbfounded by her denial of having seen the accident', 'the flabbergasted aldermen were speechless', 'was thunderstruck by the news of his promotion']

Is there a way to retrieve the part of speech along with the group of lemma names?

845

asked Apr 04 '11 05:04

aks

1 Answers

def synset(word):
    wn.synsets(word)

doesn't return anything so by default you get None

you should write

def synset(word):
    return wn.synsets(word)

Extracting lemma names:

from nltk.corpus import wordnet
syns = wordnet.synsets('car')
syns[0].lemmas[0].name
>>> 'car'
[s.lemmas[0].name for s in syns]
>>> ['car', 'car', 'car', 'car', 'cable_car']


[l.name for s in syns for l in s.lemmas]
>>>['car', 'auto', 'automobile', 'machine', 'motorcar', 'car', 'railcar', 'railway_car', 'railroad_car', 'car', 'gondola', 'car', 'elevator_car', 'cable_car', 'car']

117

answered Sep 19 '22 18:09

Andrey Sboev

Related questions
                            
                                How to save a trained model by scikit-learn? [duplicate]
                            
                                Pandas: How do I return a row value once a column reaches a certain value of another column?
                            
                                Can pyautogui be used to prevent windows screen lock?
                            
                                Understanding accumulated gradients in PyTorch
                            
                                How to transpose the contents of lines and columns in a file in Vim?
                            
                                python classes that refer to each other
                            
                                os.path.basename works with URLs, why?
                            
                                Python inheritance and calling parent class constructor
                            
                                Block requests from *.appspot.com and force custom domain in Google App Engine
                            
                                Why do I have to press Ctrl+D twice to close stdin?
                            
                                Python returning the wrong length of string when using special characters
                            
                                Cannot bind to address after socket program crashes
                            
                                Convert IP address string to binary in Python
                            
                                How to convert an HTML table to an array in python
                            
                                Python print statements being buffered with > output redirection
                            
                                Ubuntu packages needed to compile Python 2.7
                            
                                Why can't python infer types like scala? [duplicate]
                            
                                Way to use ast.literal_eval() to convert string into a datetime?
                            
                                Converting lists of tuples to strings Python
                            
                                Regular expression to remove line breaks

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

To find synonyms, definitions and example sentences using WordNet

Tags:

python

nltk

wordnet

aks

People also ask

1 Answers

Andrey Sboev

Recent Activity

Donate For Us