Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What is the default nltk part of speech tagset?

Tags:

python

nlp

nltk

While experimenting with NLTK part of speech tagging, I noticed a lot of VBP tags in the output of my calls to nltk.pos_tag. I noticed this tag is not in the Brown Corpus part of speech tagset. It is however a part of the UPenn tagset.

What tagset does nltk use by default? I can't find this in the official documentation or the apidocs.

like image 981
DeaconDesperado Avatar asked Oct 16 '12 15:10

DeaconDesperado


1 Answers

Ntlk uses PennTreebank tagset . Have a look at this link http://nltk.org/api/nltk.tag.html

like image 169
Chandan Gupta Avatar answered Nov 15 '22 00:11

Chandan Gupta