<pre class="prettyprint"><code>byte[] byteArray = Charset.forName("UTF-8").encode("hello world").array(); System.out.println(byteArray.length); </code></pre> Why does the above line of code prints out 12, shouldn't it be printing 11 instead?

Because it returns a <code>ByteBuffer</code>. That's the buffer's capacity (not really even that because of possible slicing), not how many bytes are used. It's a bit like how <code>malloc(10)</code> is free to return 32 bytes of memory. <pre class="prettyprint"><code>System.out.println(Charset.forName("UTF-8").encode("hello world").limit()); </code></pre> That's 11 (as expected).

What is the length of a string encoded in a ByteBuffer

Tags:

java

character-encoding

byte[] byteArray = Charset.forName("UTF-8").encode("hello world").array();
System.out.println(byteArray.length);

Why does the above line of code prints out 12, shouldn't it be printing 11 instead?

364

asked Sep 19 '14 19:09

Umesh

2 Answers

The length of the array is the size of the ByteBuffer's capacity, which is generated from, but not equal to the number of characters you are encoding. Let's take a look at how we allocate memory for a ByteBuffer...

If you drill into the encode() method, you'll find that CharsetEncoder#encode(CharBuffer) looks like this:

public final ByteBuffer encode(CharBuffer in)
    throws CharacterCodingException
{
    int n = (int)(in.remaining() * averageBytesPerChar());
    ByteBuffer out = ByteBuffer.allocate(n);
    ...

According to my debugger, the averageBytesPerChar of a UTF_8$Encoder is 1.1, and the input String has 11 characters. 11 * 1.1 = 12.1, and the code casts the total to an int when it does the calculation, so the resulting size of the ByteBuffer is 12.

125

answered Nov 15 '22 19:11

azurefrog

Because it returns a ByteBuffer. That's the buffer's capacity (not really even that because of possible slicing), not how many bytes are used. It's a bit like how malloc(10) is free to return 32 bytes of memory.

System.out.println(Charset.forName("UTF-8").encode("hello world").limit());

That's 11 (as expected).

answered Nov 15 '22 19:11

David Ehrmann

Related questions
                            
                                Antlr Extraneous Input
                            
                                How to check if the radio button is selected or not in Selenium WebDriver?
                            
                                Log4j2 AsyncLogger with rolling file appender not showing file line number
                            
                                How to write a generic iteration of a function using Java 8?
                            
                                Solution for secure-processing org.xml.sax.SAXNotRecognizedException causing java.lang.IllegalStateException running inside Tomcat
                            
                                How to get element's value from XML using SAX parser in startElement?
                            
                                how to rename a field in a JsonNode using jackson API
                            
                                Rfc2898DeriveBytes in java
                            
                                Broken pipe error when running Gradle test
                            
                                passing java string variable in mysql query
                            
                                How to fix ClassNotFoundException: org.apache.commons.logging.LogFactory?
                            
                                Can't persist emojis with mysql and hibernate
                            
                                Apache Spark or Cascading framework? [closed]
                            
                                Why are JVM memory parameters usually in multiples of 256?
                            
                                Webapp with MEAN stack and Java
                            
                                Jackson Modules for Map Serialization
                            
                                execute gradle shadowjar task twice in same build file
                            
                                How to stop Marshaller class adding XML tag in my output file
                            
                                Spring Core Framework - Where are the beans hold?
                            
                                Java SE embedded and Java ME

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With