Implement word2vec in Keras

1 Answers

Is this possible?

You've already answered it yourself: yes. In addition to word2veckeras, which uses gensim, here's another CBOW implementation that doesn't have extra dependencies (just in case, I'm not affiliated with this repo). You can use them as examples.

How can I fit the model?

Since the training data is the large corpus of sentences, the most convenient method is model.fit_generator, which "fits the model on data generated batch-by-batch by a Python generator". The generator runs indefinitely yielding (word, context, target) CBOW (or SG) tuples, but you manually specify sample_per_epoch and nb_epoch to limit the training. This way you decouple sentence analysis (tokenization, word index table, sliding window, etc) and actual keras model, plus save a lot of resources.

Should I use custom loss function?

CBOW minimizes the distance between the predicted and true distribution of the center word, so in the simplest form categorical_crossentropy will do it. If you implement negative sampling, which is a bit more complex, yet much more efficient, the loss function changes to binary_crossentropy. Custom loss function is unnecessary.

For anyone interested in details of math and probabilistic model, I highly recommend CS224D class by Stanford. Here is the lecture notes about word2vec, CBOW and Skip-Gram.

Another useful reference: word2vec implementation in pure numpy and c.

108

answered Oct 19 '22 22:10

Maxim

Related questions
                            
                                R count number of commas and string
                            
                                Parsing natural language ingredient quantities for recipes [closed]
                            
                                What programming language is the most English-like? [closed]
                            
                                Web/browser-oriented open source machine learning projects?
                            
                                Algorithm (or C# library) for identifying 'keywords' in a set of messages? [closed]
                            
                                C# library to build correct english sentences [closed]
                            
                                Computing symmetric Kullback-Leibler divergence between two documents
                            
                                What's the ideal way to include dictionaries (gazetteer) in spaCy to improve NER?
                            
                                Compose synthetic English phrase that would contain 160 bits of recoverable information
                            
                                Extracting one-hot vector from text
                            
                                NLP for extracting actions from text
                            
                                Some NLP stuff to do with grammar, tagging, stemming, and word sense disambiguation in Python
                            
                                How to search a String in Class in c#
                            
                                Is there a C# utility for matching patterns in (syntactic parse) trees?
                            
                                Parsing multiple sentences with MaltParser using NLTK
                            
                                Convert plural nouns into singular nouns
                            
                                How to get inflections for a word using Wordnet
                            
                                NLTK ViterbiParser fails in parsing words that are not in the PCFG rule

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Implement word2vec in Keras

Tags:

deep-learning

nlp

keras

word2vec

theano

András

People also ask

1 Answers

Maxim

Recent Activity

Donate For Us