I'm using gensim implementation of Word2Vec. I have the following code snippet: <pre class="prettyprint"><code>print('training model') model = Word2Vec(Sentences(start, end)) print('trained model:', model) print('vocab:', model.vocab.keys()) </code></pre> When I run this in python2, it runs as expected. The final print is all the words in the vocabulary. However, if I run it in python3, I get an error: <pre class="prettyprint"><code>trained model: Word2Vec(vocab=102, size=100, alpha=0.025) Traceback (most recent call last): File "learn.py", line 58, in <module> train(to_datetime('-4h'), to_datetime('now'), 'model.out') File "learn.py", line 23, in train print('vocab:', model.vocab.keys()) AttributeError: 'Word2Vec' object has no attribute 'vocab' </code></pre> What is going on? Is gensim word2vec not compatible with python3?

Are you using the same version of gensim in both places? Gensim 1.0.0 moves <code>vocab</code> to a helper object, so whereas in pre-1.0.0 versions of gensim (in Python 2 or 3), you can use: <pre class="prettyprint"><code>model.vocab </code></pre> ...in gensim 1.0.0+ you should instead use (in Python 2 or 3)... <pre class="prettyprint"><code>model.wv.vocab </code></pre>

Gensim word2vec in python3 missing vocab

Tags:

I'm using gensim implementation of Word2Vec. I have the following code snippet:

print('training model')
model = Word2Vec(Sentences(start, end))
print('trained model:', model)
print('vocab:', model.vocab.keys())

When I run this in python2, it runs as expected. The final print is all the words in the vocabulary.

However, if I run it in python3, I get an error:

trained model: Word2Vec(vocab=102, size=100, alpha=0.025)
Traceback (most recent call last):
  File "learn.py", line 58, in <module>
    train(to_datetime('-4h'), to_datetime('now'), 'model.out')
  File "learn.py", line 23, in train
    print('vocab:', model.vocab.keys())
AttributeError: 'Word2Vec' object has no attribute 'vocab'

What is going on? Is gensim word2vec not compatible with python3?

256

asked Feb 28 '17 19:02

Sam Lee

1 Answers

Are you using the same version of gensim in both places? Gensim 1.0.0 moves vocab to a helper object, so whereas in pre-1.0.0 versions of gensim (in Python 2 or 3), you can use:

model.vocab

...in gensim 1.0.0+ you should instead use (in Python 2 or 3)...

model.wv.vocab

114

answered Oct 04 '22 01:10

gojomo

Related questions
                            
                                postgreSQL alter column data type to timestamp without time zone
                            
                                StackOverflowError with Scala on IntelliJ
                            
                                The origin server did not find a current representation for the target resource or is not willing to disclose that one exists. on deploying to tomcat
                            
                                Converting a pandas multi-index series to a dataframe by using second index as columns
                            
                                Android EditText with different floating label and placeholder
                            
                                ImportError: No module named django_extensions
                            
                                Binding Into Map With KClass Type
                            
                                Flutter - Push and Get value between routes
                            
                                How to pass elegantly Sklearn's GridseachCV's best parameters to another model?
                            
                                Open ng-bootstrap modal programmatically
                            
                                ImportError: No module named 'keras'
                            
                                How to pass null to an Observable with nullable type in RxJava 2 and Kotlin

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With