Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Search engine with 'did you mean' [duplicate]

Possible Duplicate:
How does the Google “Did you mean?” Algorithm work?

I have a database table with about 2 million records. I'm using mysql fulltext to searching but users very often enter bad words ex gmes -> should be games so I need a library with PHP wrappers to have feature like in google 'did you mean' What should I use?

like image 388
ekapek Avatar asked Aug 05 '10 14:08

ekapek


People also ask

How do the search engines identify duplicate content?

Duplicate content is content which is available on multiple URLs on the web. Because more than one URL shows the same content, search engines don't know which URL to list higher in the search results. Therefore they might rank both URLs lower and give preference to other webpages.

How does Google identify duplicated content?

Google uses a predictive method to detect duplicate content based on URL patterns, which could lead to pages being incorrectly identified as duplicates. In order to prevent unnecessary crawling and indexing, Google tries to predict when pages may contain similar or duplicate content based on their URLs.

What are duplicate websites?

Duplicate content is content that appears on the Internet in more than one place. That “one place” is defined as a location with a unique website address (URL) - so, if the same content appears at more than one web address, you've got duplicate content.

How do I stop Google duplicate content?

301 redirects are the easiest way to fix duplicate content issues on your site. So if you found a bunch of duplicate content pages on your site, redirect them back to the original. Once Googlebot stops by, it will process the redirect and ONLY index the original content.


2 Answers

You could use an API like Yahoo's Spelling Suggestion as any easy way of completing it with out having to roll your own.

like image 196
Treffynnon Avatar answered Oct 17 '22 06:10

Treffynnon


An index is a list of words, and the place they occur. E.g. games occurs on row 123 and 456. If you have such a list with all words, you can easily search for the word that matches best. This way, you could match gmes with games.

However, this is not possible with the index MySQL provides.

like image 1
Sjoerd Avatar answered Oct 17 '22 07:10

Sjoerd