Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Which phrase extraction tool is the state of art now?

I know of the following open source tools, but I haven't found any comparisons of how good they are respectively. Tools with ready to use phrase extraction:

  • KEA
  • MAUI (http://code.google.com/p/maui-indexer/)
  • Dragon, xTract (http://dragon.ischool.drexel.edu/xtract.asp)
  • Lingpipe (http://alias-i.com/lingpipe/demos/tutorial/interestingPhrases/read-me.html)
  • Mahout (https://cwiki.apache.org/MAHOUT/collocations.html)
  • Anything else

Did anyone ever see such a comparison?

like image 589
yura Avatar asked Mar 24 '12 17:03

yura


1 Answers

MAUI outperforms KEA on my experiments. There is a comparison on unsupervised automatic key phrase extraction methods (Coling 2010 paper). But they don't analyse supervised methods, I'm planning to do that in a near future.

In addition, I've also explored a richer set of features which improved the performance of automatic Key Phrase Extraction which is still far from perfect. I might release the extended version of MAUI with those extensions next year.

Please read the following papers or email me more details:

Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization

Keyphrase Cloud Generation of Broadcast News

like image 196
Luis Avatar answered Sep 24 '22 23:09

Luis