I have the following code <pre class="prettyprint"><code>public class MainDefault { public static void main (String[] args) { System.out.println("²³"); System.out.println(Arrays.toString("²³".getBytes())); } } </code></pre> But can't seem to print the special characters to the console When I do the following, I get the following result <pre class="prettyprint"><code>$ javac MainDefault.java $ java MainDefault </code></pre> <img src="https://i.stack.imgur.com/YqKEd.png" alt="MainDefaultPrinting"> On the other hand, when I compile it and run it like this <pre class="prettyprint"><code>$ javac -encoding UTF8 MainDefault.java $ java MainDefault </code></pre> <img src="https://i.stack.imgur.com/wrvzm.png" alt="MainDefaultUTF8CompilationOnly"> And when I run it using the file encoding UTF8 flag, I get the following <pre class="prettyprint"><code>$ java -Dfile.encoding=UTF8 MainDefault </code></pre> <img src="https://i.stack.imgur.com/PSafY.png" alt="MainDefaultUTF8CompilationAndRun"> It's doesn't seem to be a problem with the console (Git Bash on Windows 10), as it prints the characters normally <img src="https://i.stack.imgur.com/URPsy.png" alt="Echo"> Thanks for your help

Your code are not printing the right characters in the console because your Java program and the console are using different character sets, different encodings. If you want to obtain the same characters, you first need to determine which character sets are in place. This process will depend on the "console" in which you are outputting your results. If you are working with Windows and <code>cmd</code>, as @RickJames suggested, you can use the <code>chcp</code> command to determine the active code page. Oracle provides the Java full supported encodings information, and the correspondence with other alias - code pages in this case - in this page. This stackoverflow answer also provides some guidance about the mapping between Windows Code Pages and Java charsets. As you can see in the provided links, the code page for <code>UTF-8</code> is <code>65001</code>. If you are using Git Bash (MinTTY), you can follow @kriegaex instructions to verify or configure <code>UTF-8</code> as the terminal emulator encoding. Linux and UNIX, or UNIX derived systems like Mac OS, do not use code page identifiers, but locales. The locale information can vary between systems, but you can either use the <code>locale</code> command or try to inspect the <code>LC_*</code> system variables to find the required information. This is the output of the <code>locale</code> command in my system: <pre class="prettyprint lang-sh prettyprint-override"><code>LANG="es_ES.UTF-8" LC_COLLATE="es_ES.UTF-8" LC_CTYPE="es_ES.UTF-8" LC_MESSAGES="es_ES.UTF-8" LC_MONETARY="es_ES.UTF-8" LC_NUMERIC="es_ES.UTF-8" LC_TIME="es_ES.UTF-8" LC_ALL= </code></pre> Once you know this information, you need to run your Java program with the <code>file.encoding</code> VM option corresponding to the right charset: <pre class="prettyprint lang-sh prettyprint-override"><code>java -Dfile.encoding=UTF8 MainDefault </code></pre> Some classes, like <code>PrintStream</code> or <code>PrintWriter</code>, allows you to indicate the <code>Charset</code> in which the information will be outputted. The <code>-encoding</code> <code>javac</code> option only allows you to specify the character encoding used by source files. If you are using Windows with Git Bash, consider also reading this @rmunge answer: it provides information about a possible bug in the tool that may be the reason for the problem and that prevents the terminal from running correctly out of the box without the need for manual encoding adjustments.

I am also using the Git Bash on Windows 10 and It works totally fine for me. Here's how it prints, <img src="https://i.stack.imgur.com/ZQ0lg.png" alt="Trying to reproduce it in Git Bash on Windows 10"> Terminal version is <code>mintty 3.0.2 (x86_64-pc-msys)</code> and My text properties were, <img src="https://i.stack.imgur.com/fWAWz.png" alt="enter image description here"> So, I tried to reproduce your outputs by changing Character Sets; <img src="https://i.stack.imgur.com/BPptW.png" alt="enter image description here"> By setting Character Set to <code>CP437 (OEM codepage)</code> (Note that this automatically changed Locale to <code>C</code> too), I could be able to get the output as you got. <img src="https://i.stack.imgur.com/sJVco.png" alt="enter image description here"> And then after when I change it back to <code>UTF-8 (Unicode)</code>, the I could get the output as expected! <img src="https://i.stack.imgur.com/RfIy8.png" alt="enter image description here"> Therefore, it is clear that the problem is with your console's Character Set.

UTF-8 does not print characters to the console

Tags:

java

character-encoding

compilation

encoding

utf-8

I have the following code

public class MainDefault {
        public static void main (String[] args) {
                System.out.println("²³");
                System.out.println(Arrays.toString("²³".getBytes()));
        }
}

But can't seem to print the special characters to the console

When I do the following, I get the following result

$ javac MainDefault.java
$ java MainDefault

MainDefaultPrinting

On the other hand, when I compile it and run it like this

$ javac -encoding UTF8 MainDefault.java
$ java MainDefault

MainDefaultUTF8CompilationOnly

And when I run it using the file encoding UTF8 flag, I get the following

$ java -Dfile.encoding=UTF8 MainDefault

MainDefaultUTF8CompilationAndRun

It's doesn't seem to be a problem with the console (Git Bash on Windows 10), as it prints the characters normally

Echo

Thanks for your help

622

asked Sep 02 '20 19:09

Yassin Hajaj

2 Answers

Your code are not printing the right characters in the console because your Java program and the console are using different character sets, different encodings.

If you want to obtain the same characters, you first need to determine which character sets are in place.

This process will depend on the "console" in which you are outputting your results.

If you are working with Windows and cmd, as @RickJames suggested, you can use the chcp command to determine the active code page.

Oracle provides the Java full supported encodings information, and the correspondence with other alias - code pages in this case - in this page.

This stackoverflow answer also provides some guidance about the mapping between Windows Code Pages and Java charsets.

As you can see in the provided links, the code page for UTF-8 is 65001.

If you are using Git Bash (MinTTY), you can follow @kriegaex instructions to verify or configure UTF-8 as the terminal emulator encoding.

Linux and UNIX, or UNIX derived systems like Mac OS, do not use code page identifiers, but locales. The locale information can vary between systems, but you can either use the locale command or try to inspect the LC_* system variables to find the required information.

This is the output of the locale command in my system:

LANG="es_ES.UTF-8"
LC_COLLATE="es_ES.UTF-8"
LC_CTYPE="es_ES.UTF-8"
LC_MESSAGES="es_ES.UTF-8"
LC_MONETARY="es_ES.UTF-8"
LC_NUMERIC="es_ES.UTF-8"
LC_TIME="es_ES.UTF-8"
LC_ALL=

Once you know this information, you need to run your Java program with the file.encoding VM option corresponding to the right charset:

java -Dfile.encoding=UTF8 MainDefault

Some classes, like PrintStream or PrintWriter, allows you to indicate the Charset in which the information will be outputted.

The -encoding javac option only allows you to specify the character encoding used by source files.

If you are using Windows with Git Bash, consider also reading this @rmunge answer: it provides information about a possible bug in the tool that may be the reason for the problem and that prevents the terminal from running correctly out of the box without the need for manual encoding adjustments.

167

answered Oct 09 '22 17:10

jccampanero

I am also using the Git Bash on Windows 10 and It works totally fine for me.

Here's how it prints,

Trying to reproduce it in Git Bash on Windows 10

Terminal version is mintty 3.0.2 (x86_64-pc-msys) and My text properties were,

enter image description here

So, I tried to reproduce your outputs by changing Character Sets;

enter image description here

By setting Character Set to CP437 (OEM codepage) (Note that this automatically changed Locale to C too), I could be able to get the output as you got.

enter image description here

And then after when I change it back to UTF-8 (Unicode), the I could get the output as expected!

enter image description here

Therefore, it is clear that the problem is with your console's Character Set.

answered Oct 09 '22 15:10

Tharindu Sathischandra

Related questions
                            
                                Java Stream API storing lambda expression as variable
                            
                                Java - How to Solve this 2D Array Hour Glass?
                            
                                Does completableFuture in Java 8 scale to multiple cores?
                            
                                How to disable a node without greying it out in JavaFX?
                            
                                Java expressions [duplicate]
                            
                                Add a simple row to JavaFx tableView
                            
                                Spring Data CrudRepository @Query With LIKE and IgnoreCase
                            
                                Java - 'Finally' equivalent for if statement
                            
                                how to remove a query parameter from a query string
                            
                                Append object to list and return result in Java 8?
                            
                                Why is this Java static field null?
                            
                                PolyUtil.containsLocation doesn't work as expected
                            
                                Spring WebFlux: Emit exception upon null value in Spring Data MongoDB reactive repositories?
                            
                                JAXB - How to marshal java object without header
                            
                                What is difference between of listofIntegers.add(ValueOf(50)); and listofIntegers.add(50); in Java
                            
                                Happens-before rules in Java Memory Model
                            
                                What is the purpose to use direct memory in Java?
                            
                                How to upload files with graphql-java?
                            
                                Java Excel/POJO Mapping in POI
                            
                                Shorter way to check for not null for multiple variables

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

UTF-8 does not print characters to the console

Tags:

java

character-encoding

compilation

encoding

utf-8

Yassin Hajaj

People also ask

2 Answers

jccampanero

Tharindu Sathischandra

Recent Activity

Donate For Us