How can I iterate through the unicode codepoints of a Java String?

1 Answers

Yes, Java uses a UTF-16-esque encoding for internal representations of Strings, and, yes, it encodes characters outside the Basic Multilingual Plane (BMP) using the surrogacy scheme.

If you know you'll be dealing with characters outside the BMP, then here is the canonical way to iterate over the characters of a Java String:

final int length = s.length(); for (int offset = 0; offset < length; ) {    final int codepoint = s.codePointAt(offset);     // do something with the codepoint     offset += Character.charCount(codepoint); }

180

answered Oct 03 '22 17:10

Jonathan Feinberg

Related questions
                            
                                What are the differences between PMD and FindBugs?
                            
                                How can I create an array in Kotlin like in Java by just providing a size?
                            
                                All inclusive Charset to avoid "java.nio.charset.MalformedInputException: Input length = 1"?
                            
                                What are detached, persistent and transient objects in hibernate?
                            
                                Guava equivalent for IOUtils.toString(InputStream)
                            
                                Why JSF saves the state of UI components on server?
                            
                                Is there a method that calculates a factorial in Java? [closed]
                            
                                Is it bad practice to use Reflection in Unit testing? [duplicate]
                            
                                Where does Java's String constant pool live, the heap or the stack?
                            
                                How to change highlighted occurrences color in Eclipse's sidebar?
                            
                                How to make Java honor the DNS Caching Timeout?
                            
                                How can I check if multiplying two numbers in Java will cause an overflow?
                            
                                How to pass JVM options from bootRun
                            
                                XML Document to String
                            
                                Multi-line tooltips in Java?
                            
                                Multiple queries executed in java in single statement
                            
                                Access string.xml Resource File from Java Android Code
                            
                                What is the difference between <jsp:include page = ... > and <%@ include file = ... >? [duplicate]
                            
                                File changed listener in Java
                            
                                Getting all names in an enum as a String[]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I iterate through the unicode codepoints of a Java String?

Tags:

java

string

unicode

rampion

People also ask

1 Answers

Jonathan Feinberg

Recent Activity

Donate For Us