While running a Python script using NLTK I got this:
Traceback (most recent call last): File "cpicklesave.py", line 56, in <module> pos = nltk.pos_tag(words) File "/usr/lib/python2.7/site-packages/nltk/tag/__init__.py", line 110, in pos_tag tagger = PerceptronTagger() File "/usr/lib/python2.7/site-packages/nltk/tag/perceptron.py", line 140, in __init__ AP_MODEL_LOC = str(find('taggers/averaged_perceptron_tagger/'+PICKLE)) File "/usr/lib/python2.7/site-packages/nltk/data.py", line 641, in find raise LookupError(resource_not_found) LookupError: ********************************************************************** Resource u'taggers/averaged_perceptron_tagger/averaged_perceptro n_tagger.pickle' not found. Please use the NLTK Downloader to obtain the resource: >>> nltk.download() Searched in: - '/root/nltk_data' - '/usr/share/nltk_data' - '/usr/local/share/nltk_data' - '/usr/lib/nltk_data' - '/usr/local/lib/nltk_data' **********************************************************************
Can anyone explain the problem?
'] punkt is the required package for tokenization. Hence you may download it using nltk download manager or download it programmatically using nltk. download('punkt') .
punkt is used for tokenising sentences and averaged_perceptron_tagger is used for tagging words with their parts of speech (POS). We also need to set the add this directory to the NLTK data path. import os import nltk # Create NLTK data directory NLTK_DATA_DIR = './nltk_data' if not os. path. exists(NLTK_DATA_DIR): os.
Use
>>> nltk.download()
to install the missing module (the Perceptron Tagger).
(check also the answers to Failed loading english.pickle with nltk.data.load)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With