Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

NLTK Lookup Error

While running a Python script using NLTK I got this:

Traceback (most recent call last):   File "cpicklesave.py", line 56, in <module>     pos = nltk.pos_tag(words)   File "/usr/lib/python2.7/site-packages/nltk/tag/__init__.py", line 110, in pos_tag     tagger = PerceptronTagger()   File "/usr/lib/python2.7/site-packages/nltk/tag/perceptron.py", line 140, in __init__     AP_MODEL_LOC = str(find('taggers/averaged_perceptron_tagger/'+PICKLE))   File "/usr/lib/python2.7/site-packages/nltk/data.py", line 641, in find     raise LookupError(resource_not_found) LookupError: **********************************************************************   Resource u'taggers/averaged_perceptron_tagger/averaged_perceptro   n_tagger.pickle' not found.  Please use the NLTK Downloader to   obtain the resource:  >>> nltk.download()   Searched in:     - '/root/nltk_data'     - '/usr/share/nltk_data'     - '/usr/local/share/nltk_data'     - '/usr/lib/nltk_data'     - '/usr/local/lib/nltk_data' ********************************************************************** 

Can anyone explain the problem?

like image 996
Shiv Shankar Avatar asked Mar 08 '16 07:03

Shiv Shankar


People also ask

What is NLTK Download (' Punkt ')?

'] punkt is the required package for tokenization. Hence you may download it using nltk download manager or download it programmatically using nltk. download('punkt') .

What is NLTK Averaged_perceptron_tagger?

punkt is used for tokenising sentences and averaged_perceptron_tagger is used for tagging words with their parts of speech (POS). We also need to set the add this directory to the NLTK data path. import os import nltk # Create NLTK data directory NLTK_DATA_DIR = './nltk_data' if not os. path. exists(NLTK_DATA_DIR): os.


1 Answers

Use

>>> nltk.download() 

to install the missing module (the Perceptron Tagger).

(check also the answers to Failed loading english.pickle with nltk.data.load)

like image 107
user2314737 Avatar answered Sep 23 '22 12:09

user2314737