Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Unicode range for Japanese

Tags:

unicode

I am trying to separate English and Japanese characters. I need to find Unicode range of all Japanese characters. What is Unicode range of all Japanese characters ?

like image 352
Neeraj Kumar Avatar asked Nov 11 '13 05:11

Neeraj Kumar


People also ask

Can Unicode be used for Japanese?

Character encodings. There are several standard methods to encode Japanese characters for use on a computer, including JIS, Shift-JIS, EUC, and Unicode. While mapping the set of kana is a simple matter, kanji has proven more difficult.

What's the complete range for Japanese characters in Unicode?

The Unicode range for Japanese is 4e00-9fbf, 3040-309f and 30a0-30ff.

Does UTF-8 support Japan?

The Unicode Standard supports all of the CJK characters from JIS X 0208, JIS X 0212, JIS X 0221, or JIS X 0213, for example, and many more. This is true no matter which encoding form of Unicode is used: UTF-8, UTF-16, or UTF-32.

What is the range of Unicode character set?

Since there are so many Unicode characters available, we have divided our list into ranges of 64 characters. Also, we do not represent every possible Unicode character in this table. You may notice that the first 4 Unicode Ranges include the same characters as the ASCII standard and extended ASCII.


1 Answers

As zawhtut mentioned, this page has a reference for several unicode ranges. To summarize the ranges:

  • Japanese-style punctuation ( 3000 - 303f)
  • Hiragana ( 3040 - 309f)
  • Katakana ( 30a0 - 30ff)
  • Full-width roman characters and half-width katakana ( ff00 - ffef)
  • CJK unifed ideographs - Common and uncommon kanji ( 4e00 - 9faf)
like image 141
Sean Colombo Avatar answered Sep 23 '22 12:09

Sean Colombo