Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Multi-language search matching

Suppose we have the name written in any none-latin letters - languages, like Arabic, Hebrew, Chinese, Japanese etc.

How could a search engine match between the original name and the English spelling of the same name. and vice versa?

Something like the name 拓海 in Japanese and the English spelling Takumi.

what is the algorithm/technique used to do this ?

like image 267
EzzatA Avatar asked Jun 07 '26 04:06

EzzatA


1 Answers

good day.

you have to do following:

classificate each lang in the world on the same symbols:

all langs:

  • Engish [26 letters] a b c d e f g ...
  • Russian [33 letters] a б в г д е ....
  • Chinese [x letters] ....
  • Ukrainian [x letters] a б в г д ..... i
  • Japanese [x letters] ...
  • .................

finally you will be have rules between any symbols spelling in any langs. Some langs, for instance, Hindi, Chinese and etc not will be have any rules. you should be create your own rules(based on transcription of this langs).

algo:

[w][e][п] = wep

e e r

e - eng r - rus transcription[п] = p

like image 101
Dmitry Zagorulkin Avatar answered Jun 08 '26 17:06

Dmitry Zagorulkin



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!