I am playing around with <code>FastText</code>, https://pypi.python.org/pypi/fasttext,which is quite similar to <code>Word2Vec</code>. Since it seems to be a pretty new library with not to many built in functions yet, I was wondering how to extract morphological similar words. For eg: <code>model.similar_word("dog")</code> -> dogs. But there is no function built-in. If I type <code>model["dog"]</code> I only get the vector, that might be used to compare cosine similarity. <code>model.cosine_similarity(model["dog"], model["dogs"]])</code>. Do I have to make some sort of loop and do <code>cosine_similarity</code> on all possible pairs in a text? That would take time ...!!!

You can install and import gensim library and then use gensim library to extract most similar words from the model that you downloaded from FastText. Use this: <pre class="prettyprint"><code>import gensim model = gensim.models.KeyedVectors.load_word2vec_format('model.vec') similar = model.most_similar(positive=['man'],topn=10) </code></pre> And by topn parameter you get the top 10 most similar words.

How to find similar words with FastText?

Tags:

python

nlp

word2vec

fasttext

I am playing around with FastText, https://pypi.python.org/pypi/fasttext,which is quite similar to Word2Vec. Since it seems to be a pretty new library with not to many built in functions yet, I was wondering how to extract morphological similar words.

For eg: model.similar_word("dog") -> dogs. But there is no function built-in.

If I type model["dog"]

I only get the vector, that might be used to compare cosine similarity. model.cosine_similarity(model["dog"], model["dogs"]]).

Do I have to make some sort of loop and do cosine_similarity on all possible pairs in a text? That would take time ...!!!

843

asked Feb 13 '17 14:02

Isbister

1 Answers

You can install and import gensim library and then use gensim library to extract most similar words from the model that you downloaded from FastText.

Use this:

import gensim
model = gensim.models.KeyedVectors.load_word2vec_format('model.vec')
similar = model.most_similar(positive=['man'],topn=10)

And by topn parameter you get the top 10 most similar words.

answered Sep 24 '22 08:09

Md Rashad Al Hasan Rony

Related questions
                            
                                How to execute Python script from Java (via command line)?
                            
                                Numpy chain comparison with two predicates
                            
                                How to disable Jinja2 for sections of template with {}?
                            
                                Add item into array if not already in array
                            
                                Adding +1 to a variable inside a function [duplicate]
                            
                                using regular expressions to exclude characters in a string search?
                            
                                When, if ever, to use the 'is' keyword in Python?
                            
                                How to get the MySQL type of error with PyMySQL?
                            
                                How to do JSON handler in Django
                            
                                numpy.fft() what is the return value amplitude + phase shift OR angle?
                            
                                Returning the URL's as a list from a YouTube search query [closed]
                            
                                django.core.exceptions.ImproperlyConfigured: The SECRET_KEY setting must not be empty
                            
                                Kerberos installation error, error: Setup script exited with error: command 'i686-linux-gnu-gcc' failed with exit status 1
                            
                                how to pass multiple parameters to class during initialization
                            
                                how to change image illumination in opencv python
                            
                                The current URL, app/, didn't match any of these
                            
                                How to install gnu gettext (>0.15) on windows? So I can produce .po/.mo files in Django
                            
                                Managing contents of requirements.txt for a Python virtual environment
                            
                                how to install python3-tk in centos?
                            
                                Pandas cumulative count [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With