I understand the implicit value of part-of-speech tagging and have seen mentions about its use in parsing, text-to-speech conversion, etc. Could you tell me how is the output of a PoS tagger formated ? Also, could you explain how is such an output used by other tasks/parts of an NLP system?

Basically, the goal of a POS tagger is to assign linguistic (mostly grammatical) information to sub-sentential units. Such units are called tokens and, most of the time, correspond to words and symbols (e.g. punctuation). Considering the format of the output, it doesn't really matter as long as you get a sequence of token/tag pairs. Some POS taggers allow you to specify some specific output format, others use XML or CSV/TSV, and so on.

Uses/Applications of Part-of-speech-tagging (POS Tagging)

2 Answers

One purpose of PoS tagging is to disambiguate homonyms. For instance, take this sentence :

I fish a fish

The same sentence in french would be Je pêche un poisson. Without tagging, fish would be translated the same way in both case, which would lead to a wrong traduction. However, after PoS tagging, the sentence would be

I_PRON fish_VERB a_DET fish_NOUN

From a computer point of view, both words are now distinct. This wat, they can be processed much more efficiently (in our example, fish_VERB will be translated to pêche and fish_NOUN to poisson).

113

answered Oct 21 '22 13:10

merours

Basically, the goal of a POS tagger is to assign linguistic (mostly grammatical) information to sub-sentential units. Such units are called tokens and, most of the time, correspond to words and symbols (e.g. punctuation).

Considering the format of the output, it doesn't really matter as long as you get a sequence of token/tag pairs. Some POS taggers allow you to specify some specific output format, others use XML or CSV/TSV, and so on.

answered Oct 21 '22 13:10

Pierre

Related questions
                            
                                How to split an NLP parse tree to clauses (independent and subordinate)?
                            
                                Reduce Google's Word2Vec model with Gensim
                            
                                Generating dictionaries to categorize tweets into pre-defined categories using NLTK
                            
                                Simple spell checking algorithm
                            
                                Natural Language Processing for Smart Homes
                            
                                Using Stanford Parser(CoreNLP) to find phrase heads
                            
                                Bytes vs Characters vs Words - which granularity for n-grams?
                            
                                Testing the NLTK classifier on specific file
                            
                                NLTK - TypeError: tagged_words() got an unexpected keyword argument 'simplify_tags'
                            
                                What to do when Seq2Seq network repeats words over and over in output?
                            
                                Spacy NLP library: what is maximum reasonable document size
                            
                                removing stop words using spacy
                            
                                Difficulty in understanding the tokenizer used in Roberta model
                            
                                I have a list of country codes and a list of language codes. How do I map from country code to language code?
                            
                                Natural Language Parsing tools: what is out there and what is not? [closed]
                            
                                How to compute letter frequency similarity?
                            
                                how to choose parameters in TfidfVectorizer in sklearn during unsupervised clustering
                            
                                Dataframe as datasource in torchtext
                            
                                Is there a database, API, or parsable text for getting verb conjugations?
                            
                                NLP to find relationship between entities

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Uses/Applications of Part-of-speech-tagging (POS Tagging)

Tags:

nlp

part-of-speech

H W

People also ask

2 Answers

merours

Pierre

Recent Activity

Donate For Us