Possible Duplicate:
How does the Google “Did you mean?” Algorithm work?
I have a database table with about 2 million records. I'm using mysql fulltext to searching but users very often enter bad words ex gmes -> should be games so I need a library with PHP wrappers to have feature like in google 'did you mean' What should I use?
Duplicate content is content which is available on multiple URLs on the web. Because more than one URL shows the same content, search engines don't know which URL to list higher in the search results. Therefore they might rank both URLs lower and give preference to other webpages.
Google uses a predictive method to detect duplicate content based on URL patterns, which could lead to pages being incorrectly identified as duplicates. In order to prevent unnecessary crawling and indexing, Google tries to predict when pages may contain similar or duplicate content based on their URLs.
Duplicate content is content that appears on the Internet in more than one place. That “one place” is defined as a location with a unique website address (URL) - so, if the same content appears at more than one web address, you've got duplicate content.
301 redirects are the easiest way to fix duplicate content issues on your site. So if you found a bunch of duplicate content pages on your site, redirect them back to the original. Once Googlebot stops by, it will process the redirect and ONLY index the original content.
You could use an API like Yahoo's Spelling Suggestion as any easy way of completing it with out having to roll your own.
An index is a list of words, and the place they occur. E.g. games occurs on row 123 and 456. If you have such a list with all words, you can easily search for the word that matches best. This way, you could match gmes with games.
However, this is not possible with the index MySQL provides.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With