Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to make an International Soundex?

Tags:

soundex

E.g. the Soundex algorithm is optimized for English. Is there a more universal algorithm that would apply across large families of languages?

like image 210
torial Avatar asked Sep 24 '08 14:09

torial


1 Answers

SOUNDEX is indeed English-oriented. Two others that take a wider variety of phonetic differences into account are: Double Metaphone and NYSIIS.

They produce encodings into a much larger possible space than SOUNDEX does. Double Metaphone, specifically, includes reductions with the express purpose of handling alternate pronunciations based on more languages than English.

I did a presentation on fuzzy string matching recently, the slides may be helpful.

like image 123
Kyle Burton Avatar answered Dec 31 '22 23:12

Kyle Burton