Default Encoding and changes

Tags:

By default, Character and String use UTF-16, however, for all practical purposes, in North America and most of the english locales, UTF-8 is sufficient (since it can go upto 4 bytes). So, if I use a InputStreamReader(InputStream), then does it give me default UTF-16 char encoding? Using a InputStreamReader(InputStream, "UTF-8") would provide a UTF-8 encoding, which would suffice my purpose.

How can I auto-set my JVM's default encoding to UTF-8 while using English locale? The intention is to improve performance for Character and String manipulation (by using 8-bit scheme instead of 16-bit encoding and most ASCII is covered using 8-bit encoding and at the same time complying with Unicode standard).

Any comments are appreciated. Thanks!

913

asked Oct 10 '13 14:10

Ashley

1 Answers

The in-memory data types for text in java, char, Character, and String, are UTF-16. Absolutely. Always. Unconditionally.

The only thing you can change is how Java converts from bytes-on-the-outside to chars-on-the-inside. There is no way to change the representation to UTF-8 to trade space for time.

105

answered Sep 22 '22 15:09

bmargulies

Related questions
                            
                                Predeployment of PersistenceUnit failed with EclipseLink
                            
                                Subtraction of 1ms leads to unexpected behaviour
                            
                                Java process hanging on IOUtils. Suspected deadlock
                            
                                Serialize custom exception to JSON, not all fields are serialized
                            
                                Spring+hibernate java.lang.StackOverflowError
                            
                                Job and Task Scheduling In Hadoop
                            
                                java.lang.SecurityException with two conflicting versions of javax.servlet.servlet-api jars
                            
                                Java Enum or other collection
                            
                                Can Class<V> take multiple bounds on the generic type?
                            
                                tomcat-users.xml will not open in UBUNTU
                            
                                Is Java's LinkedList optimized to do get(index) in reverse when necessary?
                            
                                Discarding input from socket
                            
                                Extending class at runtime
                            
                                Memory leak in java ImageIO.read()
                            
                                Does PriorityQueue's remove method rearrange the heap?
                            
                                Running jar as a Linux service - init.d script gets stuck starting app
                            
                                Dividing two integers to a double in java
                            
                                Manipulate Java object reference using class constructor
                            
                                Spring-boot: set default value to configurable properties
                            
                                How to write the clear() method in the list data structure?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Default Encoding and changes

Tags:

java

encoding

unicode

utf-8

Ashley

People also ask

1 Answers

bmargulies

Recent Activity

Donate For Us