I have a word, according to that i want to find out whether the text is related to that word or not using python and nltk is it possible ? For example I have a word called "phosphorous". I would like to find out that the particular text file is related to this word or not? I cant use bag of words in nltk as I have only one word and no training data. Any Suggestions? Thanks in Advance.

Not without a corpus, no. Look at it this way: can you, an intelligent being, tell whether 光 is related to 部屋に入った時電気をつけました without asking someone or something that actually knows Japanese (assuming you don't know Japanese; if you do, try with "svjetlo" and "Kad je u&scaron;ao u sobu, upalio je lampu"). If you can't, how do you expect a computer to do it? And another experiment - can you, an intelligent being, give me the algorithm by which you can teach a non-english-speaking person that "light" is related to "When he entered the room, he turned on the lamp"? Again, no. tl;dr: You need training data, unless you significantly restrict the meaning of "related" (to "contains", for example).

You can use the nltk wordnet to calculate path similarity score between the word and words in your other text and estimate a heuristics based on that score: <code>from nltk.corpus import wordnet as wn hit = wn.synset('hit.v.01') slap = wn.synset('slap.v.01') wn.path_similarity(hit, slap)</code> You can find more nltk word-net usage examples here: http://www.nltk.org/howto/wordnet.html

Word and Text relation using python and NLP

2 Answers

Not without a corpus, no.

Look at it this way: can you, an intelligent being, tell whether 光 is related to 部屋に入った時電気をつけました without asking someone or something that actually knows Japanese (assuming you don't know Japanese; if you do, try with "svjetlo" and "Kad je ušao u sobu, upalio je lampu"). If you can't, how do you expect a computer to do it?

And another experiment - can you, an intelligent being, give me the algorithm by which you can teach a non-english-speaking person that "light" is related to "When he entered the room, he turned on the lamp"? Again, no.

tl;dr: You need training data, unless you significantly restrict the meaning of "related" (to "contains", for example).

144

answered Oct 17 '22 04:10

Amadan

You can use the nltk wordnet to calculate path similarity score between the word and words in your other text and estimate a heuristics based on that score:

from nltk.corpus import wordnet as wn hit = wn.synset('hit.v.01') slap = wn.synset('slap.v.01') wn.path_similarity(hit, slap)

You can find more nltk word-net usage examples here: http://www.nltk.org/howto/wordnet.html

answered Oct 17 '22 06:10

D Volsky

Related questions
                            
                                Python **kwargs in parent and child constructor
                            
                                Django - using reverse() on Class-based views
                            
                                Multiple y-axis conversion scales
                            
                                How to avoid type checking arguments to Python function
                            
                                Connecting to remote PostgreSQL database with SQLAlchemy using SSH tunneling with public key and passphrase, all that from a Windows machine
                            
                                Cython+distutils build on Ubuntu Python 3 changes the module lib name during linking
                            
                                How can I use the output from tkFileDialog.askdirectory to fill a tkinter Entry box
                            
                                Pandas Diff() on first records in timeseries, missing data returns NaN
                            
                                Python-requests: Check if URL is not HTML webpage
                            
                                Bogus escape error when running regex
                            
                                Python 3.4 multiprocessing recursive Pool.map()
                            
                                Continue after SIGNAL with a python script in gdb
                            
                                Remove a dictionary key if it is a substring in any other key
                            
                                Elegant way to slice a list with a condition
                            
                                How to read in a file with a mixture of different delimiters using Python csv module?
                            
                                Python repeat set generator
                            
                                Why can't I freeze_panes on the xlsxwriter object pandas is creating for me?
                            
                                Why is one memoization strategy slower than the other?
                            
                                BeautifulSoup: do not add spaces where they matter, remove them where they don't
                            
                                ValueError: x and y must have same first dimension

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Word and Text relation using python and NLP

Tags:

python

artificial-intelligence

classification

nlp

nltk

Abhilash Kumar

People also ask

2 Answers

Amadan

D Volsky

Recent Activity

Donate For Us