I saw a comment here that all solutions with <code>charAt</code> are wrong. I could not exactly understand and find something about <code>charAt</code> on internet. As I look the source code it just returns an element from the char array. So my question is that if there any problem or issue about using <code>charAt</code>? Comment is like that <blockquote> Strictly speaking, all the solutions based on charAt are wrong, as charAt doesn't give you the "character at", but the "code unit at", and there are code units that are not characters and characters that need multiple code units. </blockquote>

Different characters are encoded with a different numbers of bytes (using UTF-16 scheme). For example, the "A" character is represented as follows: <pre class="prettyprint"><code>01000001 </code></pre> So far so good. But if you have a character like 𝔴, you'll have a problem. Its UTF-16 representation (BE) is: <pre class="prettyprint"><code>11011000 00110101 11011101 00110100 </code></pre> And then <code>charAt</code> can indeed return the second code unit for that character. See the JDK 7 implementation of <code>String#charAt</code>: <pre class="prettyprint"><code>public char charAt(int index) { if ((index < 0) || (index >= count)) { throw new StringIndexOutOfBoundsException(index); } return value[index + offset]; } </code></pre>

Possible problems with String reversing using charAt method

Tags:

java

string

charat

I saw a comment here that all solutions with charAt are wrong. I could not exactly understand and find something about charAt on internet. As I look the source code it just returns an element from the char array. So my question is that if there any problem or issue about using charAt?

Comment is like that

Strictly speaking, all the solutions based on charAt are wrong, as charAt doesn't give you the "character at", but the "code unit at", and there are code units that are not characters and characters that need multiple code units.

603

asked Mar 14 '16 13:03

user1474111

1 Answers

Different characters are encoded with a different numbers of bytes (using UTF-16 scheme). For example, the "A" character is represented as follows:

01000001

So far so good.

But if you have a character like 𝔴, you'll have a problem. Its UTF-16 representation (BE) is:

11011000 00110101 11011101 00110100

And then charAt can indeed return the second code unit for that character.

See the JDK 7 implementation of String#charAt:

public char charAt(int index) {
    if ((index < 0) || (index >= count)) {
        throw new StringIndexOutOfBoundsException(index);
    }
    return value[index + offset];
}

154

answered Oct 18 '22 18:10

Maroun

Related questions
                            
                                how can I get the text before and after the "-" (dash)
                            
                                How can I disable System.out for speed in Java
                            
                                Open URL in Java to get the content
                            
                                KeyStore with BouncyCastleProvider: KeyStore integrity check failed
                            
                                Infinite Scrolling Image ViewPager
                            
                                Any difference between String = null and String.isEmpty?
                            
                                Java Ternary Operator to set True or false
                            
                                What's the Java regular expression for an only integer numbers string?
                            
                                Compare two lists for updates, deletions and additions
                            
                                JUnit Exception Testing
                            
                                Java SOAP "wsimport" - force wrapped binding from document/literal wrapped WSDL?
                            
                                How to convert ArrayList to String[] in java, Arraylist contains VO objects
                            
                                Convert char* to jstring in JNI, when char* is passed using va_arg
                            
                                How do I run java program with multiple classes from cmd?
                            
                                Java MongoDB FindOne to get last inserted record
                            
                                Error when creating a new maven project
                            
                                Difference between throw and throws in Java? [duplicate]
                            
                                Eclipse error "Could not find or load main class"
                            
                                How do I implement JDatePicker
                            
                                Using if else in For Loop increment

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With