Is there a limit in Gensim's Doc2Vec most_similar documents result set?

Tags:

python-3.x

gensim

I have been experimenting with the doc2vec module for sometime now. I can train my model and have the trained model output similar documents for a given document as follows :

import re
modelloaded=Doc2Vec.load("model_all_doc_dm_1")

st = 'long description of a document as string'
doc = re.sub('[^a-zA-Z]', ' ', st).lower().split() 

new_doc_vec = modelloaded.infer_vector(doc)

modelloaded.docvecs.most_similar([new_doc_vec])

This works well, and gives me 10 results. Is there a way to get more than 10 results or is that the limit?

633

asked Nov 18 '15 20:11

ajaanbaahu

1 Answers

I found it:

modelloaded.docvecs.most_similar([new_doc_vec], topn=N)

the topn=N handle can be used to get more than 10 results.

184

answered Sep 18 '22 15:09

ajaanbaahu

Related questions
                            
                                scipy.integrate.quad gives wrong result on large ranges
                            
                                Is there a way to use super() to call the __init__ method of each base class in Python?
                            
                                Float division of big numbers in python
                            
                                Why does this asyncio.Task never finish cancelling?
                            
                                Have IPython run using Python 3 and not Python 2
                            
                                Rendering unicode in pygame
                            
                                Pytest does not pick up test methods inside a class
                            
                                Can I format the docstring in python3
                            
                                Python 3 networkx draw_graphviz() does not work
                            
                                Python3 .title() of utf-8 strings
                            
                                struct.error: required argument is not an integer
                            
                                How to add multiple pictures in Python ebay sdk
                            
                                Python Sorting Contents of txt file
                            
                                gsutil not working on mac and python3.5
                            
                                Properly Implement Shutil.Error in Python
                            
                                Getting ImportError when running nosetests
                            
                                ImportError: No module named 'keras'
                            
                                Filtering list of tuples based on condition
                            
                                How to convert a binary (string) into a float value?
                            
                                Why isn't isnumeric working?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a limit in Gensim's Doc2Vec most_similar documents result set?

Tags:

python-3.x

gensim

ajaanbaahu

People also ask

1 Answers

ajaanbaahu

Recent Activity

Donate For Us