What exactly does String.codePointAt do?

Tags:

Recently I ran into codePointAt method of String in Java. I found also a few other codePoint methods: codePointBefore, codePointCount etc. They definitely have something to do with Unicode but I do not understand it.

Now I wonder when and how one should use codePointAt and similar methods.

795

asked Sep 05 '12 11:09

Michael

1 Answers

Short answer: it gives you the Unicode codepoint that starts at the specified index in String. i.e. the "unicode number" of the character at that position.

Longer answer: Java was created when 16 bit (aka a char) was enough to hold any Unicode character that existed (those parts are now known as the Basic Multilingual Plane or BMP). Later, Unicode was extended to include characters with a codepoint > 2¹⁶. This means that a char could no longer hold all possible Unicode codepoints.

UTF-16 was the solution: it stores the "old" Unicode codepoints in 16 bit (i.e. exactly one char) and all the new ones in 32 bit (i.e. two char values). Those two 16 bit values are called a "surrogate pair". Now strictly speaking a char holds a "UTF-16 code unit" instead of "a Unicode character" as it used to.

Now all the "old" methods (handling only char) could be used just fine as long as you didn't use any of the "new" Unicode characters (or didn't really care about them), but if you cared about the new characters as well (or simply need to have complete Unicode support), then you'll need to use the "codepoint" versions that actually support all possible Unicode codepoints.

Note: A very well known example of unicode characters that are not in the BMP (i.e. work only when using the codepoint variant) are Emojis: Even the simple Grinning Face 😀 U+1F600 can't be represented in a single char.

answered Oct 02 '22 00:10

Joachim Sauer

Related questions
                            
                                Java's Virtual Machine's Endianness
                            
                                How to create a JButton with a menu?
                            
                                How to configure transaction management for working with 2 different db in Spring?
                            
                                Why does the default Object.toString() include the hashcode?
                            
                                XML syntax validation in Java [closed]
                            
                                Are there compelling reasons not to use Groovy?
                            
                                How do you use a Java Library?
                            
                                Lombok with hibernate
                            
                                Using nested enum types in Java
                            
                                Spring 3.1 entityManagerFactory java.lang.NoSuchFieldError: NULL Error
                            
                                antMatchers Spring Security pattern with changeable URL user ID
                            
                                Start thread at springboot application
                            
                                Advantages of using application/json over text/plain? [closed]
                            
                                Comparing two classes by its types or class names
                            
                                Can someone clarify Gson's unicode encoding?
                            
                                Explanation of generic <T extends Comparable<? super T>> in collection.sort/ comparable code?
                            
                                What is the difference between openjdk-7-jre-headless and openjdk-7-jre(jdk)? [duplicate]
                            
                                How to change auto-generated code when creating new class in Eclipse
                            
                                How to serialize static data members of a Java class?
                            
                                Multiline text in Excel cells

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What exactly does String.codePointAt do?

Tags:

java

string

unicode

codepoint

Michael

People also ask

1 Answers

Joachim Sauer

Recent Activity

Donate For Us