Automatically pick tags from context using Python

Question

How can I pick tags from an article or a user's post using Python?

Is the following method ok?

Build a list of word frequency from the text and sort them.
Remove some common words and pick the top 10 words remained in the list as the tags.

If the above method is ok, what library can detect if which words are common, like "the, if, you, etc" and which are descriptive words?

ʞɔıu · Accepted Answer

Here's an article on removing stop words. The link to the stop word list in the article is broken but here's another one.

paprika · Answer

The Natural Language Toolkit offers a broad variety of methods for this kind of stuff. I can't give you hands-on advice as I'm not familiar with this subject, but I think it's worth the effort to read a few articles about this topic first before you start: just picking words from the text directly won't get you very far I think, you should probably try to find similar words to the ones for that tags already exist. And of course you need to filter out the common words of the language like "the" and stuff. Again, this Python library can help you with this, at least for a few common languages.

Automatically pick tags from context using Python

Tags:

python

tags

jack

2 Answers

ʞɔıu

paprika

Recent Activity

Donate For Us

Automatically pick tags from context using Python

Tags:

python

tags

jack

2 Answers

ʞɔıu

paprika

Related questions

Recent Activity

Donate For Us