Algorithm for computing the relevance of a keyword to a short text (50 - 100 words)

Question

I want to compute the relevance of a keyword to a short description text. What would be the best approach in terms of efficiency and ease of implementation. I am using C++?

moinudin · Accepted Answer

Simple solution: Count the occurrences of the word in the text.

To do a good job though is a hard problem that companies like Google have been working on for years. If possible, you might want to take a look at using their technology

To expand, try the following:

Use a dictionary (e.g. WordNet to replace all synonyms with a common word
Detect similar words using Levenshtein distance

That's still only going to get you so far. You'll need to perform some natural language processing to truly understand what the description is about to distinguish between multiple texts containing the keyword the same number of times.

Leniel Maccaferri · Answer

Refer to these previous Stack Overflow questions:

What are Useful Ranking Algorithms for Documents without Links (e.g. PDF, MS Documents, etc…)?
Algorithm for generating a 'top list' using word frequency.

Algorithm for computing the relevance of a keyword to a short text (50 - 100 words)

Tags:

string

algorithm

matching

heuristics

fgungor

2 Answers

moinudin

Leniel Maccaferri

Recent Activity

Donate For Us

Algorithm for computing the relevance of a keyword to a short text (50 - 100 words)

Tags:

string

algorithm

matching

heuristics

fgungor

2 Answers

moinudin

Leniel Maccaferri

Related questions

Recent Activity

Donate For Us