What is the Unicode U+001A Character? Aka 0x1A

1 Answers

U+001A is defined in the Unicode Standard as a control character with the name SUBSTITUTE, and it belongs to a group characterized as follows, in chapter 16 of the standard: “There are 65 code points set aside in the Unicode Standard for compatibility with the C0 and C1 control codes defined in the ISO/IEC 2022 framework [...] The Unicode Standard provides for the intact interchange of these code points, neither adding to nor subtracting from their semantics. The semantics of the control codes are generally determined by the application with which they are used. However, in the absence of specific application uses, they may be interpreted according to the control function semantics specified in ISO/IEC 6429:1992.”

ISO 6429 is effectively equivalent to ECMA 48, which mentions this code as having the short name SUB, too, and defines it as follows: “SUB is used in the place of a character that has been found to be invalid or in error. SUB is intended to be introduced by automatic means.” This reflects the definition of this control code in Ascii.

Thus, in general, U+001A may be used to indicate a character-level data error, such as the presence of bytes, in purported character data, that have no interpretation in the character encoding being applied. Loosely speaking, it would thus mean “bad character data”, but more appropriately “malformed data, when trying to interpret data as characters”. However, in Unicode, U+FFFD REPLACEMENT CHARACTER is more appropriate, as it has specific Unicode semantics.

Since the question has been tagged with “xml”, it needs to be noted that in XML 1.0, U+001A is forbidden, by clause 2.2 Characters. Note that the comment “any Unicode character, excluding the surrogate blocks, FFFE, and FFFF” is misleading (but comments are non-normative); U+001A is a Unicode character, though it is not a graphic character and its effect is not defined in the Unicode Standard.

147

answered Oct 06 '22 10:10

Jukka K. Korpela

Related questions
                            
                                How to convert an XmlDocument to an array<byte>?
                            
                                Android Databinding xml duplicate attribute
                            
                                Can anybody recommend a free xslt tool? [closed]
                            
                                Override layout xml from android framework
                            
                                JSON Schema compared with XML Schema and their future
                            
                                What browsers support XSLT 2.0?
                            
                                What does this mean "xmlns:xliff"? XML
                            
                                XSL if: test with multiple test conditions
                            
                                How can I set the color of android rating bar's stroke? (Not the color of the stars but the BORDER)
                            
                                How do I validate xml against a DTD file in Python
                            
                                JAXB required=true doesn't seem to require
                            
                                What is the difference between xsd and xsi?
                            
                                XSL - rounding/format-number problem
                            
                                XML Namespace URI with HTTPS?
                            
                                invalid byte 2 of 2-byte UTF-8 sequence
                            
                                what actually is PCDATA and CDATA?
                            
                                Getting "ï»¿" at the beginning of my XML File after save() [duplicate]
                            
                                Where do I put my XML beans in a Spring Boot application?
                            
                                How to convert XElement to XDocument
                            
                                How can I select an element with multiple classes with Xpath?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the Unicode U+001A Character? Aka 0x1A

Tags:

xml

unicode

utf-8

utf-16

KevSheedy

People also ask

1 Answers

Jukka K. Korpela

Recent Activity

Donate For Us