Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Print unicode character in java

Displaying unicode character in java shows "?" sign. For example, i tried to print "अ". Its unicode Number is U+0905 and html representation is "अ". The below codes prints "?" instead of unicode character.

char aa = '\u0905';
String myString = aa + " result" ;
System.out.println(myString); // displays "? result"

Is there a way to display unicode character directly from unicode itself without using unicode numbers? i.e "अ" is saved in file now display the file in jsp.

like image 482
Redone Avatar asked Jul 03 '17 05:07

Redone


3 Answers

Java defines two types of streams, byte and character.

The main reason why System.out.println() can't show Unicode characters is that System.out.println() is a byte stream that deal with only the low-order eight bits of character which is 16-bits.

In order to deal with Unicode characters(16-bit Unicode character), you have to use character based stream i.e. PrintWriter.

PrintWriter supports the print( ) and println( ) methods. Thus, you can use these methods in the same way as you used them with System.out.

PrintWriter printWriter = new PrintWriter(System.out,true);
char aa = '\u0905';
printWriter.println("aa = " + aa);
like image 184
Mohammed Tamimi Avatar answered Sep 24 '22 19:09

Mohammed Tamimi


try to use utf8 character set -

        Charset utf8 = Charset.forName("UTF-8");
        Charset def = Charset.defaultCharset();

        String charToPrint = "u0905";

        byte[] bytes = charToPrint.getBytes("UTF-8");
        String message = new String(bytes , def.name());

        PrintStream printStream = new PrintStream(System.out, true, utf8.name());
        printStream.println(message); // should print your character
like image 34
Razib Avatar answered Sep 20 '22 19:09

Razib


Your myString variable contains the perfectly correct value. The problem must be the output from System.out.println(myString) which has to send some bytes to some output to show the glyphs that you want to see.

System.out is a PrintStream using the "platform default encoding" to convert characters to byte sequences - maybe your platform doesn't support that character. E.g. on my Windows 7 computer in Germany, the default encoding is CP1252, and there's no byte sequence in this encoding that corresponds to your character.

Or maybe the encoding is correct, but simply the font that creates graphical glyphs from characters doesn't have that charater.

If you are sending your output to a Windows CMD.EXE window, then maybe both reasons apply.

But be assured, your string is correct, and if you send it to a destination that can handle it (e.g. a Swing JTextField), it'll show up correctly.

like image 39
Ralf Kleberhoff Avatar answered Sep 20 '22 19:09

Ralf Kleberhoff