In the JSON spec, what does "Since the first two characters of a JSON text will always be ASCII characters" mean?

Tags:

json

RFC 4627 on Json reads:

Encoding

JSON text SHALL be encoded in Unicode. The default encoding is UTF-8.

Since the first two characters of a JSON text will always be ASCII characters [RFC0020], it is possible to determine whether an octet stream is UTF-8, UTF-16 (BE or LE), or UTF-32 (BE or LE) by looking at the pattern of nulls in the first four octets.

What does it mean "Since the first two characters of a JSON text will always be ASCII characters [RFC0020]"? I've looked at RFC0020 but couldn't find anything about it. JSON could be {" or { " (ie whitespace before the quote.

697

asked Nov 20 '10 10:11

dan gibson

1 Answers

It means that since JSON will always start with ASCII characters (non-ASCII is only permitted in strings, which cannot be the root object), it is possible to determine from the start of the stream/file what encoding it is in.

UTF-16 and UTF-32 should have a BOM that appears at the start of the stream and by finding out what it is, you can determine the exact encoding. This is possible as one can determine if the first characters are JSON or not.

I assume the spec specifically mentions this as for many other text streams/files, this is not always possible (as most text files can start with any two characters and the two first bytes of the actual file are not known in advance).

119

answered Sep 24 '22 00:09

Oded

Related questions
                            
                                Can't convert string to system.Net.HttpContent [duplicate]
                            
                                How To Solve This Problem : Cross-Origin Read Blocking (CORB) blocked cross-origin response
                            
                                Automatically Passing and Pulling JSON data between .NET Framework and .NET Core
                            
                                Why can I put comments in some JSON files but not others?
                            
                                How to check jq result is null or not?
                            
                                How to convert a huge single line json file to a multi line file without opening it?
                            
                                How to patch container env variable in deployment with kubectl?
                            
                                How to extract keys from JSON where its value is true
                            
                                Accessing [Symbol(Response internals)] from JSON response
                            
                                JSON parse error: Cannot construct instance of `com.dto.IdDTO` (although at least one Creator exists)
                            
                                Creating ansible inventory for multiple target hosts
                            
                                IronPython "LookupError: unknown encoding: hex"
                            
                                How do you deal with authorisation on actions that return results other than ViewResult?
                            
                                Serializing objects containing django querysets
                            
                                Object mapping in objective-c (iphone) from JSON
                            
                                How to add values to a JSON object?
                            
                                Creating a "two way" configuration file
                            
                                how can i trigger ajax error callback on success callback?
                            
                                JSON Schema Builder Program
                            
                                jQuery / AJAX - response format

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With