NLTK Chunking and walking the results tree

Tags:

I'm using NLTK RegexpParser to extract noungroups and verbgroups from tagged tokens.

How do I walk the resulting tree to find only the chunks that are NP or V groups?

from nltk.chunk import RegexpParser

grammar = '''
NP: {<DT>?<JJ>*<NN>*}
V: {<V.*>}'''
chunker = RegexpParser(grammar)
token = [] ## Some tokens from my POS tagger
chunked = chunker.parse(tokens)
print chunked

#How do I walk the tree?
#for chunk in chunked:
#    if chunk.??? == 'NP':
#         print chunk

(S (NP Carrier/NN) for/IN tissue-/JJ and/CC cell-culture/JJ for/IN (NP the/DT preparation/NN) of/IN (NP implants/NNS) and/CC (NP implant/NN) (V containing/VBG) (NP the/DT carrier/NN) ./.)

700

asked Oct 01 '11 08:10

Vincent Theeten

1 Answers

This should work:

for n in chunked:
    if isinstance(n, nltk.tree.Tree):               
        if n.label() == 'NP':
            do_something_with_subtree(n)
        else:
            do_something_with_leaf(n)

answered Sep 21 '22 08:09

Savino Sguera

Related questions
                            
                                Python Fabric: How to handle arbitrary remote shell prompt for input?
                            
                                Python datetime not including DST when using pytz timezone
                            
                                What should itertools.product() yield when supplied an empty list?
                            
                                what can cause pdb.set_trace() to be ignored?
                            
                                SQLAlchemy - ObjectDeletedError: Instance '<Class at...>' has been deleted. Help
                            
                                TypedChoiceField or ChoiceField in Django
                            
                                mask a 2D numpy array based on values in one column
                            
                                python - add cookie to cookiejar
                            
                                I don't understand Jinja2 Call Blocks
                            
                                Generating a audio waveform graphic within Python
                            
                                Is there a simple way to use Python libraries from Common Lisp?
                            
                                What does this error mean: invalid ELF header
                            
                                PyObjC on Xcode 4
                            
                                "sorted 1-d iterator" based on "2-d iterator" (Cartesian product of iterators)
                            
                                TeX in matplotlib on Mac OS X and TeX Live
                            
                                How would you create a comma-delimited string from a pyodbc result row?
                            
                                How to retrieve from python dict where key is only partially known?
                            
                                Accessing bitfields while reading/writing binary data structures
                            
                                Default constructor parameters in pyyaml
                            
                                How to iterate over Unicode characters in Python 3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

NLTK Chunking and walking the results tree

Tags:

python

text-parsing

nltk

chunking

Vincent Theeten

People also ask

1 Answers

Savino Sguera

Recent Activity

Donate For Us