Is there a corpora of English words in nltk?

Tags:

nltk

Is there any way to get the list of English words in python nltk library? I tried to find it but the only thing I have found is wordnet from nltk.corpus. But based on documentation, it does not have what I need (it finds synonyms for a word).

I know how to find the list of this words by myself (this answer covers it in details), so I am interested whether I can do this by only using nltk library.

239

asked Feb 05 '15 08:02

Salvador Dali

1 Answers

Yes, from nltk.corpus import words

And check using:

>>> "fine" in words.words() True

Reference: Section 4.1 (Wordlist Corpora), chapter 2 of Natural Language Processing with Python.

answered Sep 22 '22 15:09

axiom

Related questions
                            
                                NLTK tokenize - faster way?
                            
                                NLTK Tagging spanish words using a corpus
                            
                                NLTK for Persian
                            
                                Python nltk.clean_html not implemented
                            
                                How to identify the subject of a sentence?
                            
                                What to download in order to make nltk.tokenize.word_tokenize work?
                            
                                how to use word_tokenize in data frame
                            
                                How do I test whether an nltk resource is already installed on the machine running my code?
                            
                                POS tagging in German
                            
                                Generating Ngrams (Unigrams,Bigrams etc) from a large corpus of .txt files and their Frequency
                            
                                Python can't find module NLTK
                            
                                Unable to install nltk on Mac OS El Capitan
                            
                                Understanding NLTK collocation scoring for bigrams and trigrams
                            
                                Combining a Tokenizer into a Grammar and Parser with NLTK
                            
                                NLTK for Named Entity Recognition
                            
                                What does NN VBD IN DT NNS RB means in NLTK?
                            
                                Python NLTK: How to tag sentences with the simplified set of part-of-speech tags?
                            
                                Extract list of Persons and Organizations using Stanford NER Tagger in NLTK
                            
                                Topic Modelling in MALLET vs NLTK
                            
                                Lemmatize French text [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With