Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How do I escape unicode character 0x1F in xml?

Tags:

xml

unicode

I need to write a text with the unicode character 0x1F in a utf-8 document (it is not an allowed character in xml). Is there a way to escape it, or do I have to discard it?

like image 486
Filip Avatar asked Jul 23 '09 07:07

Filip


2 Answers

This will do it in XML 1.1 (but is discouraged):



It isn't supported at all in XML 1.0. A workaround is to base-64 encode the text containing the character.

like image 187
David M Avatar answered Oct 06 '22 23:10

David M


Using 0x1F XML is illegal (http://www.w3.org/TR/xml11/#charsets). so there is no way to do it, except from encoding it yourself in some way. base-64 (as proposed), or url encode, or any other option that the xml does not understand. It is the same problem as storing binary data in xml.

like image 21
Mihai Nita Avatar answered Oct 06 '22 23:10

Mihai Nita