Determining tense of a sentence Python

Tags:

Following several other posts, [e.g. Detect English verb tenses using NLTK , Identifying verb tenses in python, Python NLTK figure out tense ] I wrote the following code to determine tense of a sentence in Python using POS tagging:

from nltk import word_tokenize, pos_tag

def determine_tense_input(sentence):
    text = word_tokenize(sentence)
    tagged = pos_tag(text)

    tense = {}
    tense["future"] = len([word for word in tagged if word[1] == "MD"])
    tense["present"] = len([word for word in tagged if word[1] in ["VBP", "VBZ","VBG"]])
    tense["past"] = len([word for word in tagged if word[1] in ["VBD", "VBN"]]) 
    return(tense)

This returns a value for the usage of past/present/future verbs, which I typically then take the max value of as the tense of the sentence. The accuracy is moderately decent, but I am wondering if there is a better way of doing this.

For example, is there now by-chance a package written which is more dedicated to extracting the tense of a sentence? [note - 2 of the 3 stack-overflow posts are 4-years old, so things may have now changed]. Or alternatively, should I be using a different parser from within nltk to increase accuracy? If not, hope the above code may help someone else!

204

asked May 03 '15 17:05

kyrenia

3 Answers

You can strengthen your approach in various ways. You could think more about the grammar of English and add some more rules based on whatever you observe; or you could push the statistical approach, extract some more (relevant) features and throw the whole lot at a classifier. The NLTK gives you plenty of classifiers to play with, and they're well documented in the NLTK book.

You can have the best of both worlds: Hand-written rules can be in the form of features that are fed to the classifier, which will decide when it can rely on them.

answered Sep 20 '22 15:09

alexis

You could use the Stanford Parser to get a dependency parse of the sentence. The root of the dependency parse will be the 'primary' verb that defines the sentence (I'm not too sure what the specific linguistic term is). You can then use the POS tag on this verb to find its tense, and use that.

answered Sep 23 '22 15:09

viswajithiii

As of http://dev.lexalytics.com/wiki/pmwiki.php?n=Main.POSTags, the tags mean

MD  Modal verb (can, could, may, must)
VB  Base verb (take)
VBC Future tense, conditional
VBD Past tense (took)
VBF Future tense
VBG Gerund, present participle (taking)
VBN Past participle (taken)
VBP Present tense (take)
VBZ Present 3rd person singular (takes)

so that your code would be

tense["future"] = len([word for word in tagged if word[1] in ["VBC", "VBF"])

answered Sep 21 '22 15:09

serv-inc

Related questions
                            
                                How to turn the result (in python) of itertools.permutations("0123456789") into list of strings
                            
                                Is it safe to access ._meta directly in your django app?
                            
                                Export template function
                            
                                Since Django discourages passing arguments to functions in templates, what is encouraged instead?
                            
                                Removing carriage return characters from a file using python
                            
                                Python using Beautiful Soup for HTML processing on specific content
                            
                                Python hangs on lxml.etree.XMLSchema(tree) with apache + mod_wsgi
                            
                                using nextSibling from BeautifulSoup outputs nothing
                            
                                Controlling a terminal application with Python
                            
                                Numpy append: Automatically cast an array of the wrong dimension
                            
                                Efficient insert of multiple rows with SQLAlchemy/SQLite3 when duplicate entries exist
                            
                                Alternatives to imp.find_module?
                            
                                Appengine GET parameters
                            
                                Reading POST body with bottle.py
                            
                                Find index of all rows with null values in a particular column in pandas dataframe
                            
                                What does this sentence mean in 'The Zen of Python'?
                            
                                How to search if dictionary value contains certain string with Python
                            
                                Start, End and Duration of Maximum Drawdown in Python
                            
                                Nose ignores test with custom decorator
                            
                                Why is sin(180) not zero when using python and numpy?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Determining tense of a sentence Python

Tags:

python

nlp

nltk