Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Error correction in names

I am trying to device an algorithm that performs error correction in names. My approach is having a database with the correct names, compute edit distance between each of them and the name entered and then suggest the 5 or 10 closest.

This task is significantly different from standard error correction in words as some of the names might be replaced by initials. For instance "Jonathan Smith" and "J. Smith" are actually quite close and could easily be considered the same name, so the edit distance should be really small if not 0. Another challenge is that some names might be written differently while sounding the same. For instance Shnaider and Schneider are versions of the same name written by people with different locales(there are better examples for that I guess). And another case - just imagine all the possible errors in writing Jawaharlal Nehru most of which have nothing to do with the real name. Again probably most of them will be similar phonetically.

Obviously Lucene's error correction algorithm will not help me here as it does not handle the above cases.

So my question is: do you know any library capable of doing error correction in names? Can you propose some algorithm for handling the cases mentioned above?

I am interested in libraries in c++ or java. As for algorithm proposals any language or pseudo code will do.

like image 796
Ivaylo Strandjev Avatar asked Nov 06 '12 08:11

Ivaylo Strandjev


People also ask

How do you resolve a name error?

error. Solution: Correct the typo in the syntax and retry the formula. Tip: Instead of manually entering defined names in formulas, you can have Excel do it automatically for you. To do that, go to the Formulas tab, in Defined Names group, click Use in Formula, and then select the defined name you want to add.

How do you correct someone who misspelled their name?

"Simply be honest and inform them of the correct version of your name the first time you hear them say it wrong." Unsurprisingly, directness is your best bet. Gottsman even broke down what a potential correction could look like: Person 1: "It's great to meet you, Julia."

How do you correct a name?

Petition to change your name by filling out a name change form, an order to show cause for legally changing your name, and a decree to legally change your name. Take these forms to the court clerk and file them along with your state's required filing fees.

What is an example of a name error?

One of the most common reasons people see the name error is when they have used the wrong formula name. For example, if you're using the VLOOKUP formula and you type VLOKUP instead, Excel wouldn't know what you mean, and it will show its disapproval by giving you the name error.


1 Answers

For phonetic matching, see Soundex.

I think modifying a Levenshtein distance algorithm to treat "abbreviate to an initial" and "expand from an initial" as single-distance edits ought to be straightforward, but the details are beyond me at the moment.

like image 94
Jamey Sharp Avatar answered Sep 20 '22 18:09

Jamey Sharp