I have trained my own word2vec model in gensim and I am trying to load that model in spacy. First, I need to save it in my disk and then try to load an init-model in spacy but unable to figure out exactly how. <pre class="prettyprint"><code>gensimmodel Out[252]: <gensim.models.word2vec.Word2Vec at 0x110b24b70> import spacy spacy.load(gensimmodel) OSError: [E050] Can't find model 'Word2Vec(vocab=250, size=1000, alpha=0.025)'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory. </code></pre>

Train and save your model in plain-text format: <pre class="prettyprint"><code>from gensim.test.utils import common_texts, get_tmpfile from gensim.models import Word2Vec path = get_tmpfile("./data/word2vec.model") model = Word2Vec(common_texts, size=100, window=5, min_count=1, workers=4) model.wv.save_word2vec_format("./data/word2vec.txt") </code></pre> Gzip the text file: <pre class="prettyprint"><code>gzip word2vec.txt </code></pre> Which produces a <code>word2vec.txt.gz</code> file. Run the following command: <pre class="prettyprint"><code>python -m spacy init-model en ./data/spacy.word2vec.model --vectors-loc word2vec.txt.gz </code></pre> Load the vectors using: <pre class="prettyprint"><code>nlp = spacy.load('./data/spacy.word2vec.model/') </code></pre>

In spacy, how to use your own word2vec model created in gensim?

Tags:

model

gensim

word2vec

spacy

I have trained my own word2vec model in gensim and I am trying to load that model in spacy. First, I need to save it in my disk and then try to load an init-model in spacy but unable to figure out exactly how.

gensimmodel
Out[252]:
<gensim.models.word2vec.Word2Vec at 0x110b24b70>

import spacy
spacy.load(gensimmodel)

OSError: [E050] Can't find model 'Word2Vec(vocab=250, size=1000, alpha=0.025)'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.

901

asked May 22 '18 11:05

Subigya Upadhyay

2 Answers

Train and save your model in plain-text format:

from gensim.test.utils import common_texts, get_tmpfile
from gensim.models import Word2Vec

path = get_tmpfile("./data/word2vec.model")

model = Word2Vec(common_texts, size=100, window=5, min_count=1, workers=4)
model.wv.save_word2vec_format("./data/word2vec.txt")

Gzip the text file:

gzip word2vec.txt

Which produces a word2vec.txt.gz file.

Run the following command:

python -m spacy init-model en ./data/spacy.word2vec.model --vectors-loc word2vec.txt.gz

Load the vectors using:

nlp = spacy.load('./data/spacy.word2vec.model/')

answered Nov 03 '22 00:11

hbot

As explained here, you can import custom word vectors that trained using Gensim, Fast Text, or Tomas Mikolov's original word2vec implementation, by creating a model using:

wget https://s3-us-west-1.amazonaws.com/fasttext-vectors/word-vectors-v2/cc.la.300.vec.gz
python -m spacy init-model en your_model --vectors-loc cc.la.300.vec.gz

then you can load you model, nlp = spacy.load('your_model') and use it!

Also see the similar question that answered here.

answered Nov 03 '22 00:11

Ali Zarezade

Related questions
                            
                                keras: Use one model output as another model input
                            
                                VueJS - How to make models and collections?
                            
                                Import data from excel spreadsheet to django model
                            
                                How to use full_clean() for data validation before saving in Django 1.5 gracefully?
                            
                                This expression is not callable. Type 'Number' has no call signatures
                            
                                Rails: How do self-referential has_many models work?
                            
                                Core Data - Discard changes
                            
                                Passing a model object to a RedirectToAction without polluting the URL?
                            
                                Backbone.js model.destroy() not sending DELETE request
                            
                                Eager loading in deep level nested association
                            
                                How to validate uniqueness in Rails 3 Model if I want to check if there is a 2-field combination?
                            
                                Laravel ClassLoader trying to load an old version of my model
                            
                                Ember.js: Calculate the sum of a property of all child models
                            
                                Class Modeling alternatives for Objective-C
                            
                                Laravel Generate Migration from existing Model
                            
                                Rails validation from controller
                            
                                Reputation and point system models
                            
                                rails has_one of a has_many association
                            
                                What is the standard way to organize Android code in project [closed]
                            
                                What is the difference between a Multi-table inherited model and a simple One-to-one relationship between the same two models?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With