Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Check if a String contains encoded characters

Hello I am looking for a way to detect if a string has being encoded

For example

    String name = "Hellä world";
    String encoded = new String(name.getBytes("utf-8"), "iso8859-1");

The output of this encoded variable is:

Hellä world

As you can see there is an A with grave and another symbol. Is there a way to check if the output contains encoded characters?

like image 472
Decrypter Avatar asked Nov 27 '22 06:11

Decrypter


1 Answers

Sounds like you want to check if a string that was decoded from bytes in latin1 could have been decoded in UTF-8, too. That's easy because illegal byte sequences are replaced by the character \ufffd:

String recoded = new String(encoded.getBytes("iso-8859-1"), "UTF-8");
return recoded.indexOf('\uFFFD') == -1; // No replacement character found
like image 137
Joni Avatar answered Dec 04 '22 21:12

Joni