Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

PyCharm can't find Spacy Model 'en'

I am trying to load a NLP model 'en' from SpaCy in my PyCharm and I am using Python 2.7 .
My code to load the 'en' model is nlp = spacy.load('en', disable=['parser', 'ner'])
However, I received the following error
IOError: [E050] Can't find model 'en'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.
I then realised that I didn't download the model, so I used the terminal provided in PyCharm to download the model, I used python -m spacy download en

This was the following output:

Requirement already satisfied: en_core_web_sm==2.0.0 from https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-2.0.0/en_core_web_sm-2.0.0. tar.gz#egg=en_core_web_sm==2.0.0 in c:\python27\lib\site-packages
You are using pip version 9.0.1, however version 18.0 is available.
You should consider upgrading via the 'python -m pip install --upgrade pip' command.
You do not have sufficient privilege to perform this operation.

Linking successful C:\Python27\lib\site-packages\en_core_web_sm --> C:\Python27\lib\site-packages\spacy\data\en

You can now load the model via spacy.load('en')

So I am quite confused... I presume that I was unable to download the 'en' model as I do not have enough privileges to do so, but how was the linking successful?
Upon seeing this message, I tried running my Python file again ( since the terminal stated that linking was successful) but the initial error popped out again.

Has anybody encountered this problem before, or knows how to solve this error? How am I able to 'escalate' my privileges in PyCharm terminal so that I will be able to download the model?

like image 914
festivezine Avatar asked Oct 06 '18 09:10

festivezine


2 Answers

I don't know if it is still relevant, but I run into it too. The module was loaded well on Jupyter Notebook but not in my PyCharm. To solve it go to the interpreter of your project in PyCharm (using ctrl + alt + s). See the full path to the interpreter you are using. Then use it the terminal in such a way:

FULL_PATH_TO_PYTHON_INTERPRTER -m spacy download en

It should work now from your PyCharm.

like image 175
Eli Borodach Avatar answered Oct 14 '22 19:10

Eli Borodach


Spacy explains several ways to download a model : https://spacy.io/usage/models#download

Using python -m

# Download best-matching version of specific model for your spaCy installation
python -m spacy download en_core_web_sm

# Out-of-the-box: download best-matching default model and create shortcut link
python -m spacy download en

# Download exact model version (doesn't create shortcut link)
python -m spacy download en_core_web_sm-2.2.0 --direct

Using pip

# With external URL
pip install https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-2.2.0/en_core_web_sm-2.2.0.tar.gz

# With local file
pip install /Users/you/en_core_web_sm-2.2.0.tar.gz

Using manual download

https://spacy.io/usage/models#download-manual

Now how to download it using PyCharm ?

I tried to do it by installing an URL package Project Interpreter in a virtual environment (venv) :

https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-2.2.5/en_core_web_sm-2.2.5.tar.gz

But Pycharm failed to install the tar directly.


Solution : Finally I just add the github URL of the tar in the requirements.txt, then PyCharm will install it for you.

Add this below in requirements.txt :

# spacy
spacy
# spacy model
https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-2.2.5/en_core_web_sm-2.2.5.tar.gz

You can do it for all the models on the github release page : https://github.com/explosion/spacy-models/releases/

like image 28
Adrien Arcuri Avatar answered Oct 14 '22 19:10

Adrien Arcuri