How do I do word Stemming or Lemmatization?

1 Answers

If you know Python, The Natural Language Toolkit (NLTK) has a very powerful lemmatizer that makes use of WordNet.

Note that if you are using this lemmatizer for the first time, you must download the corpus prior to using it. This can be done by:

>>> import nltk >>> nltk.download('wordnet')

You only have to do this once. Assuming that you have now downloaded the corpus, it works like this:

>>> from nltk.stem.wordnet import WordNetLemmatizer >>> lmtzr = WordNetLemmatizer() >>> lmtzr.lemmatize('cars') 'car' >>> lmtzr.lemmatize('feet') 'foot' >>> lmtzr.lemmatize('people') 'people' >>> lmtzr.lemmatize('fantasized','v') 'fantasize'

There are other lemmatizers in the nltk.stem module, but I haven't tried them myself.

155

answered Sep 19 '22 13:09

theycallmemorty

Related questions
                            
                                spacy Can't find model 'en_core_web_sm' on windows 10 and Python 3.5.3 :: Anaconda custom (64-bit)
                            
                                Any tutorials for developing chatbots? [closed]
                            
                                Fuzzy string search library in Java [closed]
                            
                                What are the major differences and benefits of Porter and Lancaster Stemming algorithms? [closed]
                            
                                Stemmers vs Lemmatizers
                            
                                Practical examples of NLTK use [closed]
                            
                                Fuzzy String Comparison
                            
                                Ordinal numbers replacement
                            
                                Stopword removal with NLTK
                            
                                Calculate cosine similarity given 2 sentence strings
                            
                                Creating a new corpus with NLTK
                            
                                Sentiment analysis for Twitter in Python [closed]
                            
                                Is there a good natural language processing library [closed]
                            
                                How to config nltk data directory from code?
                            
                                How to train the Stanford Parser with Genia Corpus?
                            
                                How to use Stanford Parser in NLTK using Python
                            
                                What does Keras Tokenizer method exactly do?
                            
                                How can I correctly prefix a word with "a" and "an"?
                            
                                Understanding min_df and max_df in scikit CountVectorizer
                            
                                word2vec: negative sampling (in layman term)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I do word Stemming or Lemmatization?

Tags:

nlp

lemmatization

stemming

manixrock

People also ask

1 Answers

theycallmemorty

Recent Activity

Donate For Us