How to train a naive bayes classifier with pos-tag sequence as a feature?

Tags:

I have two classes of sentences. Each has reasonably distinct pos-tag sequence. How can I train a Naive-Bayes classifier with POS-Tag sequence as a feature? Does Stanford CoreNLP/NLTK (Java or Python) provide any method for building a classifier with pos-tag as a feature? I know in python NaiveBayesClassifier allows for building a NB classifier but it uses contains-a-word as feature but can it be extended to use pos-tag-sequence as a feature ?

853

asked Feb 27 '15 11:02

kundan

1 Answers

If you know how to train and predict texts (or sentences in your case) using nltk's naive bayes classifier and words as features, than you can easily extend this approach in order to classify texts by pos-tags. This is because the classifier don't care about whether your feature-strings are words or tags. So you can simply replace the words of your sentences by pos-tags using for example nltk's standard pos tagger:

sent = ['So', 'they', 'have', 'internet', 'on', 'computers' , 'now']
tags = [t for w, t in nltk.pos_tag(sent)]
print tags

['IN', 'PRP', 'VBP', 'JJ', 'IN', 'NNS', 'RB']

As from now you can proceed with the "contains-a-word" approach.

194

answered Sep 21 '22 03:09

char bugs

Related questions
                            
                                Simple way to load specific sample using Pytorch dataloader
                            
                                How to calculate multiclass overall accuracy, sensitivity and specificity?
                            
                                How can I display the weights and bias from LinearRegression()?
                            
                                How to apply large python model to pyspark-dataframe?
                            
                                No batch_size while making inference with BERT model
                            
                                Ridge Regression Grid Search with Pipeline
                            
                                Does it make sense to build a residual network with only fully connected layers (instedad of convolutional layers)?
                            
                                Pytorch Unfold and Fold: How do I put this image tensor back together again?
                            
                                what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer?
                            
                                TypeError: Could not build a TypeSpec with type KerasTensor
                            
                                Document Layout Analysis for text extraction
                            
                                Question on multi-probe Local Sensitive Hashing
                            
                                Optimizing a Parking Lot Problem. What algorithms should I use to fit the most amount of cars in the lot?
                            
                                Rare Event Detection
                            
                                Interesting NLP/machine-learning style project -- analyzing privacy policies
                            
                                Naive Bayesian and zero-frequency issue
                            
                                How to hash vectors into buckets in Locality Sensitive Hashing (using jaccard distance)?
                            
                                How to determine the number of feature maps to use in a convolutional neural network layer?
                            
                                Difference between .score() and .predict in the sklearn library?
                            
                                3D-plot of the error function in a linear regression

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to train a naive bayes classifier with pos-tag sequence as a feature?

Tags:

machine-learning

naivebayes

nltk

stanford-nlp

text-classification

kundan

People also ask

1 Answers

char bugs

Recent Activity

Donate For Us