Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What's the difference between Stanford Tagger, Parser and CoreNLP?

I'm currently using different tools from Stanford NLP Group and trying to understand the differences between them. It seems to me that somehow they intersect each other, since I can use same features in different tools (e.g. tokenize, and POS-Tag a sentence can be done by Stanford POS-Tagger, Parser and CoreNLP).

I'd like to know what's the actual difference between each tool and in which situations I should use each of them.

like image 593
Felipe Leão Avatar asked Jun 02 '13 14:06

Felipe Leão


People also ask

What is Stanford Tagger?

A Part-Of-Speech Tagger (POS Tagger) is a piece of software that reads text in some language and assigns parts of speech to each word (and other token), such as noun, verb, adjective, etc., although generally computational applications use more fine-grained POS tags like 'noun-plural'.

What does Stanford parser do?

The parser can read various forms of plain text input and can output various analysis formats, including part-of-speech tagged text, phrase structure trees, and a grammatical relations (typed dependency) format.

Is Stanford coreNLP open source?

These software distributions are open source, licensed under the GNU General Public License (v3 or later for Stanford CoreNLP; v2 or later for the other releases).

What is coreNLP Python?

Stanford coreNLP is java based. This article is about its implementation in jupyter notebook (python). Stanford coreNLP can be used to extract multiple features that can be used to train any text-based machine learning model. The implementation can be a little tricky on big data.


1 Answers

All Java classes from the same release are the same, and, yes, they overlap. On a code basis, the parser and tagger are basically subsets of what is available in CoreNLP, except that they do have a couple of little add-ons of their own, such as the GUI for the parser. In terms of provided models, the parser and tagger come with models for a range of languages, whereas CoreNLP ships only with English out of the box. However, you can then download language-particular jars for CoreNLP which provide all the models we have for different languages. Anything that is available in any of the releases is present in the CoreNLP github site: https://github.com/stanfordnlp/CoreNLP

like image 189
Christopher Manning Avatar answered Nov 10 '22 10:11

Christopher Manning