I know that after training the lda model for gensim, we can get the topic for an unseen document by:
lda = LdaModel(corpus, num_topics=10)
doc_lda = lda[doc_bow]
But how about the documents that are already used for training? I mean is there a way to get the topic for a document in corpus that was used in training without treating it like a new document?
No.
Information from individual documents is distilled into the model, then forgotten.
No per-document information is kept (more generally: no information that would require O(#docs)
memory is kept).
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With