how to get parse tree using python nltk?

Tags:

nltk

Given the following sentence:

The old oak tree from India fell down.

How can I get the following parse tree representation of the sentence using python NLTK?

(ROOT (S (NP (NP (DT The) (JJ old) (NN oak) (NN tree)) (PP (IN from) (NP (NNP India)))) (VP (VBD fell) (PRT (RP down)))))

I need a complete example which I couldn't find in web!

Edit

I have gone through this book chapter to learn about parsing using NLTK but the problem is, I need a grammar to parse sentences or phrases which I do not have. I have found this stackoverflow post which also asked about grammar for parsing but there is no convincing answer there.

So, I am looking for a complete answer that can give me the parse tree given a sentence.

988

asked Feb 19 '17 02:02

Wasi Ahmad

2 Answers

Here is alternative solution using StanfordCoreNLP instead of nltk. There are few library that build on top of StanfordCoreNLP, I personally use pycorenlp to parse the sentence.

First you have to download stanford-corenlp-full folder where you have *.jar file inside. And run the server inside the folder (default port is 9000).

export CLASSPATH="`find . -name '*.jar'`"
java -mx4g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer [port?] # run server

Then in Python, you can run the following in order to tag the sentence.

from pycorenlp import StanfordCoreNLP
nlp = StanfordCoreNLP('http://localhost:9000')

text = "The old oak tree from India fell down."

output = nlp.annotate(text, properties={
  'annotators': 'parse',
  'outputFormat': 'json'
})

print(output['sentences'][0]['parse']) # tagged output sentence

168

answered Sep 24 '22 09:09

titipata

Older question, but you can use nltk together with the bllipparser. Here is a longer example from nltk. After some fiddling I myself used the following:

To install (with nltk already installed):

sudo python3 -m nltk.downloader bllip_wsj_no_aux
pip3 install bllipparser

To use:

from nltk.data import find
from bllipparser import RerankingParser

model_dir = find('models/bllip_wsj_no_aux').path
parser = RerankingParser.from_unified_model_dir(model_dir)

best = parser.parse("The old oak tree from India fell down.")

print(best.get_reranker_best())
print(best.get_parser_best())

Output:

-80.435259246021 -23.831876011253 (S1 (S (NP (NP (DT The) (JJ old) (NN oak) (NN tree)) (PP (IN from) (NP (NNP India)))) (VP (VBD fell) (PRT (RP down))) (. .)))
-79.703612178593 -24.505514522222 (S1 (S (NP (NP (DT The) (JJ old) (NN oak) (NN tree)) (PP (IN from) (NP (NNP India)))) (VP (VBD fell) (ADVP (RB down))) (. .)))

answered Sep 22 '22 09:09

vlz

Related questions
                            
                                Ubuntu , Apache2 , Django ) Fatal Python error: Py_Initialize: Unable to get the locale encoding ImportError: No module named 'encodings'
                            
                                Connecting Kafka-Python with a cluster with Kerberos
                            
                                Reverse a list without using built-in functions
                            
                                Why is my VotingClassifier accuracy less than my individual classifier?
                            
                                BeautifulSoup returns None even though the element exists
                            
                                Vectorizing NumPy covariance for 3D array
                            
                                Writing a ros node with both a publisher and subscriber?
                            
                                Cannot import sklearn.model_selection in scikit-learn
                            
                                pandas dataframe rolling window with groupby
                            
                                Memory usage steadily growing for multiprocessing.Pool.imap_unordered
                            
                                Python - Create Counter() from mapping, non-integer values
                            
                                Django-filter with DRF - How to do 'and' when applying multiple values with the same lookup?
                            
                                Modify OHLC resample code as per deprecated warning
                            
                                using matplotlib colormap with pandas dataframe.plot function
                            
                                Sqlalchemy - add columns to a query
                            
                                What does scipy.signal.convolve2d calculate? [duplicate]
                            
                                python garbage collection about list append itself [duplicate]
                            
                                Python namedtuple as argument to apply_async(..) callback
                            
                                timezone aware datetime objects in django templates
                            
                                What means the serialize=False on Primary-key field?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With