Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to identify tags (key words) automatically from a given text?

It should behave like Delicious toolbar for Firefox does; it lists possible tags to click. The effect is shown as below:

enter image description here

The code should be able to find key words for the text. Any good algorithm or open source project to recommend?

I found this post, but it is a bit too general for my specific need.

like image 985
lkahtz Avatar asked Feb 03 '11 03:02

lkahtz


People also ask

How do you identify key phrases?

Finding Keyword Phrases the WordStream Way The Keyword Research Suite includes tools for: Finding hundreds or thousands of keyword phrases relevant to your business. Finding targeted clusters of related keywords to use in search marketing campaigns. Organizing a list of keywords into small groups by relevance.


1 Answers

I think you're looking for one of these answers,

  • tag generation from a text content
  • How to extract common / significant phrases from a series of text entries
  • tag generation from a small text content (such as tweets)

In a nutshell - you're looking to extract unigrams from the text that somehow represent the concepts within it - a technique to do this is called Pointwise Mutual Information, which is illustrated with an example in the first two links. Using the Python NLTK framework (which already has a bunch of these algorithms built in) might be your best starting point to work off from.

Good luck!

like image 58
viksit Avatar answered Oct 12 '22 11:10

viksit