Java unicode where to find example N-byte unicode characters

Tags:

I'm looking for sample 1-byte, 2-byte, 3-byte, 4-byte, 5-byte, and 6-byte unicode characters. Any links to some sort of reference of all the different unicode characters out there and how big they are (byte-wise) would be greatly appreciated. I'm hoping this reference also has code points like \uXXXXX.

845

asked May 19 '11 18:05

Mohamed Nuur

1 Answers

There is no such thing as "1-byte, 2-byte, 3-byte, 4-byte, 5-byte, and 6-byte unicode characters".

You probably talk about UTF-8 representations of Unicode characters. Similarly, strings in Java are internally represented in UTF-16, so that Java char type represents a 16-bit code unit of UTF-16, and each Unicode character can be represented by either one or two these code units, and each code unit can be represented as \uxxxx in string literals (note that there are only 4 hex digits in these sequences, since code units are 16-bit long).

So, if you need a reference of Unicode characters with their UTF-8 and UTF-16 representations, you can take a look at the table at fileformat.info.

See also:

The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)
Unicode - How to get the characters right?
A to Z Index of Unicode Characters

166

answered Sep 19 '22 13:09

axtavt

Related questions
                            
                                Immutability and Readability
                            
                                cannot insert null in one to many relationship hibernate annotation
                            
                                2D Array Question Java
                            
                                Sorting a List<Number>
                            
                                Iterating generic array of any type in Java
                            
                                What is a peer class in Java?
                            
                                com.sun.awt package usage
                            
                                How to get Class for generic type?
                            
                                Storing Serializable object to file with some data excluded
                            
                                What does syso-statements mean in Java?
                            
                                How to prevent nested synchronized blocks when iterating over a collection
                            
                                How to make the resultset returned from oracle keeps its column aliases characters case
                            
                                Hibernate - TypedQuery.getResultList() returns a list of the same object
                            
                                Resource Not Found Exception
                            
                                Java Vector: clear vs removeAllElements method
                            
                                problem with utf8 in java
                            
                                Redirect System.out.println to Log4J, while keeping class name information
                            
                                what is the use of org.eclipse.jdt.launching.JRE_CONTAINER in eclipse?
                            
                                Remove gap to Parent Containers Border in Miglayout
                            
                                in java,how to iterate list of objects

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Java unicode where to find example N-byte unicode characters

Tags:

java

unicode

codepoint

sample-data

Mohamed Nuur

People also ask

1 Answers

axtavt

Recent Activity

Donate For Us