how to determine the number of topics for LDA?

Tags:

I am a freshman in LDA and I want to use it in my work. However, some problems appear.

In order to get the best performance, I want to estimate the best topic number. After reading "Finding Scientific topics", I know that I can calculate logP(w|z) firstly and then use the harmonic mean of a series of P(w|z) to estimate P(w|T).

My question is what does the "a series of" mean?

742

asked Jul 02 '13 09:07

Chelsea Wang

2 Answers

Unfortunately, there is no hard science yielding the correct answer to your question. To the best of my knowledge, hierarchical dirichlet process (HDP) is quite possibly the best way to arrive at the optimal number of topics.

If you are looking for deeper analyses, this paper on HDP reports the advantages of HDP in determining the number of groups.

146

answered Oct 03 '22 10:10

Chthonic Project

A reliable way is to compute the topic coherence for different number of topics and choose the model that gives the highest topic coherence. But sometimes, the highest may not always fit the bill.

enter image description here

See this topic modeling example.

answered Oct 03 '22 11:10

Selva

Related questions
                            
                                How to make a loading indicator for every asynchronous action (using $q) in an angularjs-app
                            
                                How do I compile my Python 3 app to an .exe? [closed]
                            
                                sending arguments to test script with istanbul
                            
                                Uploading image from Android to GCS
                            
                                Why is Javascript Regex matching every second time?
                            
                                How to make youtrack not show solved issues
                            
                                Go printing date to console
                            
                                Turning off a single usb device... again
                            
                                Symfony2, Doctrine2 - force update - table already exists on many-to-many relation
                            
                                keydown event in drop down list
                            
                                How do I stop an IntelliSense PCH Warning?
                            
                                How can I make /etc/hosts writable by root in a Docker Container?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With