Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to write unicode cross symbol in Java?

I'm trying to write this unicode cross symbol (𐀵) in Java:

class A {
    public static void main(String[] args) {
        System.out.println("\u2300");
        System.out.println("\u10035");
    }
}

I can write o with a line through it (⌀) just fine, but the cross symbol doesn't show up, instead it just prints the number 5:

# javac A.java && java A
⌀
ဃ5

Why?

like image 332
Dog Avatar asked May 17 '13 19:05

Dog


3 Answers

You're looking for U+10035, which is outside the Basic Multilingual Plane. That means you can't use \u to specify the value, as that only deals with U+0000 to U+FFFF - there are always exactly four hex digits after \u. So currently you've got U+1003 ("MYANMAR LETTER GHA") followed by '5'.

Unfortunately Java doesn't provide a string literal form which makes characters outside the BMP simple to express. The only way of including it in a literal (but still in ASCII) is to use the UTF-16 surrogate pair form:

String cross = "\ud800\udc35";

Alternatively, you could use the 32-bit code point form as an int:

String cross = new String(new int[] { 0x10035 }, 0, 1);

(These two strings are equal.)

Having said all that, your console would still need to support that character - you'll need to try it to find out whether or not it does.

like image 84
Jon Skeet Avatar answered Nov 10 '22 11:11

Jon Skeet


I believe Java represents Unicode characters from 0x0000 to 0xFFFF. Java would evaluate "\u10035" to whatever "\u1003" is and a 5 after that.

like image 44
joshreesjones Avatar answered Nov 10 '22 11:11

joshreesjones


0x10035 is a supplemental Unicode character. You'll need to font that supports it if you want your program to render it.

http://www.oracle.com/technetwork/articles/javase/supplementary-142654.html

like image 1
Tap Avatar answered Nov 10 '22 11:11

Tap