Strange character in the begining xml file

Question

I´m trying to parse one xml but it shows a error, if I put a system.out.println to the String i see it.

before

ï»¿<?xml version="1.0"

after

?<?xml version="1.0"

I´m changing the charset to UTF-8 but didn´t works, so, what should I do?

Josh · Accepted Answer

You have a UTF-8 string (which is why Notepad++ is recognizing it as such), but UTF-8 doesn't require a BOM. Some programs produce it; some don't. This leads to occasional confusion when reading files - some readers (like the one you're using in your Java code) don't recognize and ignore it. I'd recommend something like the accepted answer to this question or this one for removing it. Make sure you implement a check to determine if the first 3 bytes actually are a BOM before removing them from all incoming strings.

Strange character in the begining xml file

Tags:

java

character-encoding

xml

Diego Macario

1 Answers

Josh

Recent Activity

Donate For Us

Strange character in the begining xml file

Tags:

java

character-encoding

xml

Diego Macario

1 Answers

Josh

Related questions

Recent Activity

Donate For Us