nltk.word_tokenize() giving AttributeError: 'module' object has no attribute 'defaultdict'

Tags:

I am new to nltk. I was trying some basics.

import nltk
nltk.word_tokenize("Tokenize me")

gives me this following error

Traceback (most recent call last):
File "<pyshell#27>", line 1, in <module>
nltk.word_tokenize("hi im no onee")
File "C:\Python27\lib\site-packages\nltk\tokenize\__init__.py", line 101, in word_tokenize
return [token for sent in sent_tokenize(text, language)
File "C:\Python27\lib\site-packages\nltk\tokenize\__init__.py", line 85, in sent_tokenize
tokenizer = load('tokenizers/punkt/{0}.pickle'.format(language))
File "C:\Python27\lib\site-packages\nltk\data.py", line 786, in load
resource_val = pickle.load(opened_resource)
AttributeError: 'module' object has no attribute 'defaultdict'

Please someone help. Please tell me how to fix this error.

635

asked Jul 08 '15 14:07

Kantajit

1 Answers

I just checked it on my system.

Fix:

>> import nltk
>> nltk.download('all')

Then everything worked fine.

>> import nltk
>> nltk.word_tokenize("Tokenize me")
['Tokenize', 'me']

105

answered Sep 21 '22 23:09

Manoj

Related questions
                            
                                Extract Dates and events associated with the date from Text corpus
                            
                                Using my own corpus instead of movie_reviews corpus for Classification in NLTK
                            
                                NLTK - nltk.tokenize.RegexpTokenizer - regex not working as expected
                            
                                Using British National Corpus in NLTK
                            
                                How do I calculate the shortest path (geodesic) distance between two adjectives in WordNet using Python NLTK?
                            
                                nltk data fails to install on Ubuntu 14.04 of AWS instance type c4.xlarge
                            
                                Trying to use MEGAM as an NLTK ClassifierBasedPOSTagger?
                            
                                Removing punctuation/numbers from text problem
                            
                                which similarity function of nltk.corpus.wordnet is Appropriate for find similarity of two words?
                            
                                Using WordNet to determine semantic similarity between two texts?
                            
                                Sentiment analysis for sentences- positive, negative and neutral
                            
                                NLP - When to lowercase text during preprocessing
                            
                                Tokenizing large (>70MB) TXT file using Python NLTK. Concatenation & write data to stream errors
                            
                                Quick NLTK parse into syntax tree
                            
                                babelize_shell() not working in NLTK package
                            
                                Extracting the person names in the named entity recognition in NLP using Python
                            
                                dispersion_plot not working inspite of installing matplotlib
                            
                                No such file or directory 'nltk_data/corpora/stopwords/English' when using colab
                            
                                How nltk.TweetTokenizer different from nltk.word_tokenize?
                            
                                How to create the negative of a sentence in nltk

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

nltk.word_tokenize() giving AttributeError: 'module' object has no attribute 'defaultdict'

Tags:

defaultdict

attributeerror

nltk

Kantajit

People also ask

1 Answers

Manoj

Recent Activity

Donate For Us