Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Topic modeling on short texts Python

I want to do topic modeling on short texts. I did some research on LDA and found that it doesn't go well with short texts. What methods would be better and do they have Python implementations?

like image 556
Sri Test Avatar asked Jun 03 '20 14:06

Sri Test


3 Answers

You can try Short Text Topic Modelling (refer to this https://www.groundai.com/project/sttm-a-tool-for-short-text-topic-modeling/1) (code available at https://github.com/qiang2100/STTM) . It combine state-of-the-art algorithms and traditional topics modelling for long text which can conveniently be used for short text.

For more specialised libraries, try lda2vec-tf, which combines word vectors with LDA topic vectors. It is branched from the original lda2vec and improved upon and gives better results than the original library.

like image 78
red_mouse_coder Avatar answered Oct 24 '22 22:10

red_mouse_coder


Besides GSDM, there is also biterm implemented in python for short text topic modeling.

like image 2
chefhose Avatar answered Oct 24 '22 21:10

chefhose


The only Python implementation of short text topic modeling is GSDMM. Unfortunately, most of the others are written on Java.

like image 1
Ilya Palachev Avatar answered Oct 24 '22 22:10

Ilya Palachev