I'm writing a Hadoop/HBase job. I needed to transform a Java String
into a byte array. Is there any differences between Java's String.getBytes()
and Hadoop's Bytes.toBytes()
?
According to its documentation Bytes.toBytes()
converts the parameter to a byte[]
using UTF-8.
String.getBytes()
(without arguments) will convert the String
to byte[]
using the platform default encoding. That encoding can vary depending on the OS and user settings. Use of that method should generally be avoided.
You could use String.getBytes(String)
(or the Charset
variant) to specify the encoding to be used.
Reading the Javadoc, it appear that String.getBytes() returns a byte[]
using the default encoding and Bytes.toBytes() returns a byte[]
using UTF-8
This could be the same thing, but it might not be.
Its always useful to read the Javadoc if you want to know something. ;)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With