How to read constituency based parse tree

Tags:

I have a corpus of sentences that were preprocessed by Stanford's CoreNLP systems. One of the things it provides is the sentence's Parse Tree (Constituency-based). While I can understand a parse tree when it's drawn (like a tree), I'm not sure how to read it in this format:

E.g.:

Click to copy

          (ROOT
          (FRAG
          (NP (NN sent28))
          (: :)
          (S
          (NP (NNP Rome))
          (VP (VBZ is)
          (PP (IN in)
          (NP
          (NP (NNP Lazio) (NN province))
          (CC and)
          (NP
          (NP (NNP Naples))
          (PP (IN in)
          (NP (NNP Campania))))))))
          (. .)))

The original sentence is:

Click to copy

sent28: Rome is in Lazio province and Naples in Campania .

How am I supposed to read this tree, or alternatively, is there a code (in python) that does it properly? Thanks.

356

asked Feb 23 '15 13:02

Cheshie

1 Answers

NLTK has a class for reading parse trees: nltk.tree.Tree. The relevant method is called fromstring. You can then iterate its subtrees, leaves, etc...

As an aside: you might want to remove the bit that says sent28: as it confuses the parser (it's also not a part of the sentence). You are not getting a full parse tree, but just a sentence fragment.

135

answered Oct 17 '22 13:10

mbatchkarov

Related questions
                            
                                Filter pandas DataFrame by membership in set-of-tags
                            
                                Dynamically create variable names? [duplicate]
                            
                                Couldn't create working virtual environment for Python 3.4
                            
                                Extracting data from a scatter plot in Matplotlib
                            
                                NLTK: can I add terminal to grammar that is already generated
                            
                                Clicking a Javascript link to make a post request in Python
                            
                                Pandas replace values in dataframe timeseries
                            
                                Where is the PyQt/PySide event-loop running?
                            
                                Installing matplotlib-venn
                            
                                Why is "from ... import *" in a function not allowed?
                            
                                tweepy error response status code 400
                            
                                Can't play HTML5 video using Flask
                            
                                Customize Error Message When Permission Check Fails
                            
                                Download part of the youtube video using python
                            
                                Extract Dates and events associated with the date from Text corpus
                            
                                Problems with upgrading pip in Homebrew Python 2.7 installation
                            
                                Python Selenium find element by link text contains a string with wildcard or regex
                            
                                Numpy.cumsum in reverse
                            
                                Hive transform using Python: Unable to initialize custom script
                            
                                Implementing Chain of responsibility pattern in python using coroutines

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to read constituency based parse tree

Tags:

python

parsing

nlp

parse-tree

Cheshie

People also ask

1 Answers

mbatchkarov

Recent Activity

Donate For Us