I am trying to device an algorithm that performs error correction in names. My approach is having a database with the correct names, compute edit distance between each of them and the name entered and then suggest the 5 or 10 closest.
This task is significantly different from standard error correction in words as some of the names might be replaced by initials. For instance "Jonathan Smith" and "J. Smith" are actually quite close and could easily be considered the same name, so the edit distance should be really small if not 0. Another challenge is that some names might be written differently while sounding the same. For instance Shnaider
and Schneider
are versions of the same name written by people with different locales(there are better examples for that I guess). And another case - just imagine all the possible errors in writing Jawaharlal Nehru
most of which have nothing to do with the real name. Again probably most of them will be similar phonetically.
Obviously Lucene's error correction algorithm will not help me here as it does not handle the above cases.
So my question is: do you know any library capable of doing error correction in names? Can you propose some algorithm for handling the cases mentioned above?
I am interested in libraries in c++ or java. As for algorithm proposals any language or pseudo code will do.
error. Solution: Correct the typo in the syntax and retry the formula. Tip: Instead of manually entering defined names in formulas, you can have Excel do it automatically for you. To do that, go to the Formulas tab, in Defined Names group, click Use in Formula, and then select the defined name you want to add.
"Simply be honest and inform them of the correct version of your name the first time you hear them say it wrong." Unsurprisingly, directness is your best bet. Gottsman even broke down what a potential correction could look like: Person 1: "It's great to meet you, Julia."
Petition to change your name by filling out a name change form, an order to show cause for legally changing your name, and a decree to legally change your name. Take these forms to the court clerk and file them along with your state's required filing fees.
One of the most common reasons people see the name error is when they have used the wrong formula name. For example, if you're using the VLOOKUP formula and you type VLOKUP instead, Excel wouldn't know what you mean, and it will show its disapproval by giving you the name error.
For phonetic matching, see Soundex.
I think modifying a Levenshtein distance algorithm to treat "abbreviate to an initial" and "expand from an initial" as single-distance edits ought to be straightforward, but the details are beyond me at the moment.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With