JSON Unicode escape sequence - lowercase or not?

Tags:

I was reading RFC 4627 and I can't figure out if the following is valid JSON or not. Consider this minimalistic JSON text:

["\u005c"]

The problem is the lowercase c.

According to the text of the RFC it is allowed:

Any character may be escaped. If the character is in the Basic Multilingual Plane (U+0000 through U+FFFF), then it may be represented as a six-character sequence: a reverse solidus, followed by the lowercase letter u, followed by four hexadecimal digits that encode the character's code point. The hexadecimal letters A though F can be upper or lowercase. So, for example, a string containing only a single reverse solidus character may be represented as "\u005C".

(Emphasis mine)

The problem is that the RFC also contains the grammar for this:

char = unescaped /
       escape (
           %x22 /          ; "    quotation mark  U+0022
           %x5C /          ; \    reverse solidus U+005C
           %x2F /          ; /    solidus         U+002F
           %x62 /          ; b    backspace       U+0008
           %x66 /          ; f    form feed       U+000C
           %x6E /          ; n    line feed       U+000A
           %x72 /          ; r    carriage return U+000D
           %x74 /          ; t    tab             U+0009
           %x75 4HEXDIG )  ; uXXXX                U+XXXX

where HEXDIG is defined in referenced RFC 4234 as

HEXDIG         =  DIGIT / "A" / "B" / "C" / "D" / "E" / "F"

which includes only uppercase letters.

FWIW, from what I researched most JSON parsers accept both upper and lowercase letters.

Question(s): What is actually correct? Is there a contradiction and the grammar in the RFC should be fixed?

496

asked Jun 13 '14 22:06

Daniel Frey

1 Answers

I think it's explained by this part of RFC 4234:

ABNF strings are case-insensitive and the character set for these strings is us-ascii.

Hence:
    rulename = "abc"
and:
    rulename = "aBc"
will match "abc", "Abc", "aBc", "abC", "ABc", "aBC", "AbC", and "ABC".

On the other hand, the follow-on part is not terribly clear:

To specify a rule that IS case SENSITIVE, specify the characters individually.

For example:
    rulename    =  %d97 %d98 %d99
or
    rulename    =  %d97.98.99

In the case of the HEXDIG rule, they're individual characters to start with - but they're specified literally as "A" etc rather than %d41, so I suspect that means they're case-insensitive. It's not the clearest spec I've read :(

184

answered Oct 10 '22 23:10

Jon Skeet

Related questions
                            
                                How to extend localStorage across devices (without DB)
                            
                                How to omit null field from Swagger/OpenAPI in ResponseEntity?
                            
                                Is there a standard for specifying a version for json schema
                            
                                PySpark : Setting Executors/Cores and Memory Local Machine
                            
                                Explanation and usage of JSONP [duplicate]
                            
                                WebMethod return values in JSON format
                            
                                What is the JSP equivalent to json_encode ( in PHP )?
                            
                                Outputting jSON in a rails app
                            
                                REST (json) web service discovery protocol
                            
                                Confusion between mappings and types in ElasticSearch
                            
                                Use SQL to return a JSON string
                            
                                How to get SQL Server query result's data into JSON Format ?
                            
                                Parsing JSON into Jackson using a stream/object approach
                            
                                JSON e and JSON E
                            
                                How to make Devise authentication respond to JSON only?
                            
                                Local storage: MySQL vs. JSON?
                            
                                RestKit not deleting orphaned objects from local store
                            
                                Play: How to transform JSON while writing/reading it to/from MongoDB
                            
                                PHP: file_get_contents('php://input') returning string for JSON message
                            
                                How to create a valid empty JSON array with PowerShell?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

JSON Unicode escape sequence - lowercase or not?

Tags:

json

language-lawyer

unicode

rfc

Daniel Frey

People also ask

1 Answers

Jon Skeet

Recent Activity

Donate For Us