Approach for identifying whether a sentence includes an imperative within it

Tags:

stanford-nlp

Looking to find out whether a sentence includes an imperative within it (e.g. categorize "click below" as an imperative, whereas "here is some information" as not).

Is this possible with e.g. the Stanford Parser? For reference, the main site (http://nlp.stanford.edu/software/lex-parser.shtml) indicates 'Improved recognition of imperatives', however the dependency manual does not indicate a filed for them http://nlp.stanford.edu/software/dependencies_manual.pdf )

Alternatively, is there another approach which would work?

552

asked Apr 06 '15 14:04

kyrenia

1 Answers

I also failed to find any library or literature that (directly) addresses 'imperative detection' (there must be a different official name for it...). Here's what I've come up with by reading up on the grammar of imperatives, learning about chunking and some experimentation.

(Python + NLTK)

from nltk import RegexpParser
from nltk.tree import Tree

def is_imperative(tagged_sent):
    # if the sentence is not a question...
    if tagged_sent[-1][0] != "?":
        # catches simple imperatives, e.g. "Open the pod bay doors, HAL!"
        if tagged_sent[0][1] == "VB" or tagged_sent[0][1] == "MD":
            return True

        # catches imperative sentences starting with words like 'please', 'you',...
        # E.g. "Dave, stop.", "Just take a stress pill and think things over."
        else:
            chunk = get_chunks(tagged_sent)
            # check if the first chunk of the sentence is a VB-Phrase
            if type(chunk[0]) is Tree and chunk[0].label() == "VB-Phrase":
                return True

    # Questions can be imperatives too, let's check if this one is
    else:
        # check if sentence contains the word 'please'
        pls = len([w for w in tagged_sent if w[0].lower() == "please"]) > 0
        # catches requests disguised as questions
        # e.g. "Open the doors, HAL, please?"
        if pls and (tagged_sent[0][1] == "VB" or tagged_sent[0][1] == "MD"):
            return True

        chunk = get_chunks(tagged_sent)
        # catches imperatives ending with a Question tag
        # and starting with a verb in base form, e.g. "Stop it, will you?"
        elif type(chunk[-1]) is Tree and chunk[-1].label() == "Q-Tag":
            if (chunk[0][1] == "VB" or
                (type(chunk[0]) is Tree and chunk[0].label() == "VB-Phrase")):
                return True

    return False

# chunks the sentence into grammatical phrases based on its POS-tags
def get_chunks(tagged_sent):
    chunkgram = r"""VB-Phrase: {<DT><,>*<VB>}
                    VB-Phrase: {<RB><VB>}
                    VB-Phrase: {<UH><,>*<VB>}
                    VB-Phrase: {<UH><,><VBP>}
                    VB-Phrase: {<PRP><VB>}
                    VB-Phrase: {<NN.?>+<,>*<VB>}
                    Q-Tag: {<,><MD><RB>*<PRP><.>*}"""
    chunkparser = RegexpParser(chunkgram)
    return chunkparser.parse(tagged_sent)

Haven't tested the performance of the algorithm yet, though from my observations I'd say precision is probably better than recall. Note that the performance greatly depends on the correctness of the POS-tags.

127

answered Mar 03 '23 11:03

nischi

Related questions
                            
                                Natural language command language
                            
                                How to recognize words in text with non-word tokens?
                            
                                Korean, Thai and Indonesian POS tagger
                            
                                Wordnet selectional restrictions in NLTK
                            
                                Natural Language to Sparql
                            
                                TF-IDF Simple Use - NLTK/Scikit Learn
                            
                                Python: using scikit-learn to predict, gives blank predictions
                            
                                Entity Recognition and Sentiment Analysis using NLP
                            
                                Get noun from verb Wordnet
                            
                                Choosing appropriate sense of a word from wordnet
                            
                                SyntaxNet creating tree to root verb
                            
                                Collocations with spaCy
                            
                                Named entity recognition (NER) features
                            
                                Recurrent NNs: what's the point of parameter sharing? Doesn't padding do the trick anyway?
                            
                                Why use cosine similarity in Word2Vec when its trained using dot-product similarity
                            
                                How do I train gpt 2 from scratch?
                            
                                Decoding Permutated English Strings
                            
                                Using my own corpus for category classification in Python NLTK
                            
                                Parsing words into (prefix, root, suffix) in Python
                            
                                Conduit: Multiple Stream Consumers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With