Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

HTML special character decoding

Using Java on Android I'm struggling to convert a couple of html special characters.

So far I've tried:

String myString = "%A32.00%20per%20month%B3";

Html.fromHtml(myString).toString(); => %A32.00%20per%20month%B3
URLDecoder.decode(myString) => �2.00 per month�
URLDecoder.decode(myString, "UTF-8") => �2.00 per month�
URLDecoder.decode(myString, "ASCII") => �2.00 per month�
org.apache.commons.lang.StringEscapeUtils.unescapeHtml4(myString) => %A32.00%20per%20month%B3

The correct output should be => £2.00 per month³

like image 768
scottyab Avatar asked Dec 27 '22 09:12

scottyab


2 Answers

Your string is encoded in ISO-8859-1, so ASCII and UTF-8 won't work.

String myString = "%A32.00%20per%20month%B3";
URLDecoder.decode(myString, "ISO-8859-1");
// output: £2.00 per month³
like image 68
Floern Avatar answered Dec 31 '22 12:12

Floern


public static void main(String[] args) throws UnsupportedEncodingException {
    String before = "£2.00 per month³";
    String encoded = URLEncoder.encode(before, "UTF-8");
    String decoded = URLDecoder.decode(encoded, "UTF-8");
    System.out.println(encoded);
    System.out.println(decoded);
}

In output I get:

%C2%A32.00+per+month%C2%B3
£2.00 per month³

Are you sure that %A32.00%20per%20month%B3 is correct?

like image 32
pawelzieba Avatar answered Dec 31 '22 10:12

pawelzieba