Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How google recognises 2 words without spaces?

I want to understand how google handles no space between 2 words. For example there are 2 words - word1 and word2. I write in search box 'word1word2', it says do you mean 'word1 word2' or just understands to look for 'word1 word2'. Any information what data structure and algorithm they use? I see in this answer How to split text without spaces into list of words?, it is suggested to use trie data structure.

like image 989
John Avatar asked Jul 13 '12 12:07

John


1 Answers

In the candidate generation of the spell corrector, you allow as a possibility omission of a space, just as you allow omission of other letters.... Perhaps look at the spelling correction lecture here: http://nlp-class.org/ [sorry, self-promotion] or Peter Norvig's intro: http://norvig.com/spell-correct.html

like image 141
Christopher Manning Avatar answered Oct 30 '22 02:10

Christopher Manning