pos_tag in NLTK does not tag sentences correctly

Tags:

nltk

I have used this code:

# Step 1 : TOKENIZE
from nltk.tokenize import *
words = word_tokenize(text)

# Step 2 : POS DISAMBIG
from nltk.tag import *
tags = pos_tag(words)

to tag two sentences: John is very nice. Is John very nice?

John in the first sentence was NN while in the second was VB! So, how can we correct pos_tag function without training back-off taggers?

Modified question:

I have seen the demonstration of NLTK taggers here http://text-processing.com/demo/tag/. When I tried the option "English Taggers & Chunckers: Treebank" or "Brown Tagger", I get the correct tags. So how to use Brown Tagger for example without training it?

485

asked Dec 03 '11 04:12

user842457

1 Answers

Short answer: you can't. Slightly longer answer: you can override specific words using a manually created UnigramTagger. See my answer for custom tagging with nltk for details on this method.

answered Sep 22 '22 02:09

Jacob

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pos_tag in NLTK does not tag sentences correctly

Tags:

nltk

user842457

People also ask

1 Answers

Jacob

Recent Activity

Donate For Us

pos_tag in NLTK does not tag sentences correctly

Tags:

nltk

user842457

People also ask

1 Answers

Jacob

Related questions

Recent Activity

Donate For Us