In Natural language processing, what is the purpose of chunking?

2 Answers

Chunking is also called shallow parsing and it's basically the identification of parts of speech and short phrases (like noun phrases). Part of speech tagging tells you whether words are nouns, verbs, adjectives, etc, but it doesn't give you any clue about the structure of the sentence or phrases in the sentence. Sometimes it's useful to have more information than just the parts of speech of words, but you don't need the full parse tree that you would get from parsing.

An example of when chunking might be preferable is Named Entity Recognition. In NER, your goal is to find named entities, which tend to be noun phrases (though aren't always), so you would want to know that President Barack Obama is in the following sentence:

President Barack Obama criticized insurance companies and banks as he urged supporters to pressure Congress to back his moves to revamp the health-care system and overhaul financial regulations. (source)

But you wouldn't necessarily care that he is the subject of the sentence.

Chunking has also been fairly commonly used as a preprocessing step for other tasks like example-based machine translation, natural language understanding, speech generation, and others.

192

answered Sep 22 '22 04:09

ealdent

For "text chunking" in natural language processing, see here (you probably want all the lectures in this series as a kind of "NLP 101"...): it spans a series of tasks such as finding noun groups, finding verb groups, and complete partitioning sentence -> chunks of several types. The lecture whose URL I quoted goes into more details!

answered Sep 22 '22 04:09

Alex Martelli

Related questions
                            
                                Is there a black box method to detect if a sorting algorithm is stable?
                            
                                What is the principle of debug? [duplicate]
                            
                                What is the difference between a Decorator, Attribute, Aspect, and Trait?
                            
                                Depth vs Height of a tree. Refreshing the fundamentals
                            
                                What is 'weaving'?
                            
                                Generic and practical sorting algorithm faster than O(n log n)?
                            
                                Why Two's Complement?
                            
                                state machines and multiple states
                            
                                Unexpected token in CSS when editing it in vnext
                            
                                What would a P=NP proof be like, hypothetically?
                            
                                How do you normalize a finite state machine?
                            
                                What computer science topic am I trying to describe?
                            
                                How do I generate sentences from a formal grammar?
                            
                                What algorithms do popular C++ compilers use for std::sort and std::stable_sort?
                            
                                Do theoretical computer science topics have "real world" development applications?
                            
                                Programming from scratch [closed]
                            
                                What undergraduate computer science course best prepares programmers for the workplace? [closed]
                            
                                Does an algorithm exist which can determine whether one regular language matches any input another regular language matches?
                            
                                Rules Engine vs Expert System
                            
                                What does it mean for two binary trees to be isomorphic?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In Natural language processing, what is the purpose of chunking?

Tags:

computer-science

nlp

TIMEX

People also ask

2 Answers

ealdent

Alex Martelli

Recent Activity

Donate For Us