Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What are the unicode ranges for Hindi accented characters?

I'm trying to gather a Unicode list of all the 'o' like shapes in the Hindi character-set. In fact, a list of any characters (in any language) that makes uses of separate characters to indicate an accent would be better.

I intend to use this unicode-list in a RegExp.

I been trying to edit a list of character-ranges by outputting them in an Input TextField, but editing this text causes weird issues (the keyboard-cursor isn't place on the correct character, selections suddenly dissappear / incorrectly warps... in other words... HINDI HELL!)

I've tried this with Notepad++ too, but although it was more responsive, it eventually crapped out on me like it did in the Flash Player textfield. This seems to occur especially while removing the [] block (nulls?) characters. Some of them trigger odd behaviors.

Anyways, all I want is a list of the accents. An example of a few are in the image below (but I would need ALL accents):

enter image description here

Thanks!

like image 638
chamberlainpi Avatar asked Jan 17 '23 08:01

chamberlainpi


1 Answers

You can find pdf's containing lists of unicode ranges, grouped by language, here: http://unicode.org/charts/

For Hindi, you probably want Devanagari or Devanagari Extended.

like image 57
Peter Hall Avatar answered Apr 25 '23 03:04

Peter Hall