compact binary representation of json

Tags:

Are there any compact binary representations of JSON out there? I know there is BSON, but even that webpage says "in many cases is not much more efficient than JSON. In some cases BSON uses even more space than JSON".

I'm looking for a format that's as compact as possible, preferably some kind of open standard?

851

asked Feb 03 '11 23:02

Shezan Baig

1 Answers

You could take a look at the Universal Binary JSON specification. It won't be as compact as Smile because it doesn't do name references, but it is 100% compatible with JSON (where as BSON and BJSON define data structures that don't exist in JSON so there is no standard conversion to/from).

It is also (intentionally) criminally simple to read and write with a standard format of:

[type, 1-byte char]([length, 4-byte int32])([data])

So simple data types begin with an ASCII marker code like 'I' for a 32-bit int, 'T' for true, 'Z' for null, 'S' for string and so on.

The format is by design engineered to be fast-to-read as all data structures are prefixed with their size so there is no scanning for null-terminated sequences.

For example, reading a string that might be demarcated like this (the []-chars are just for illustration purposes, they are not written in the format)

[S][512][this is a really long 512-byte UTF-8 string....]

You would see the 'S', switch on it to processing a string, see the 4-byte integer that follows it of "512" and know that you can just grab in one chunk the next 512 bytes and decode them back to a string.

Similarly numeric values are written out without a length value to be more compact because their type (byte, int32, int64, double) all define their length of bytes (1, 4, 8 and 8 respectively. There is also support for arbitrarily long numbers that is extremely portable, even on platforms that don't support them).

On average you should see a size reduction of roughly 30% with a well balanced JSON object (lots of mixed types). If you want to know exactly how certain structures compress or don't compress you can check the Size Requirements section to get an idea.

On the bright side, regardless of compression, the data will be written in a more optimized format and be faster to work with.

I checked the core Input/OutputStream implementations for reading/writing the format into GitHub today. I'll check in general reflection-based object mapping later this week.

You can just look at those two classes to see how to read and write the format, I think the core logic is something like 20 lines of code. The classes are longer because of abstractions to the methods and some structuring around checking the marker bytes to make sure the data file is a valid format; things like that.

If you have really specific questions like the endianness (Big) of the spec or numeric format for doubles (IEEE 754) all of that is covered in the spec doc or just ask me.

Hope that helps!

146

answered Jan 02 '23 04:01

Riyad Kalla

Related questions
                            
                                Return JSON or View from Spring MVC Controller
                            
                                JSON->String in python
                            
                                Pulling data to the template from an external database with django
                            
                                How can I convert Json to data frame in R
                            
                                How to POST backbone model data to DB through Slim php and Paris
                            
                                Comparing two JSON objects irrespective of the sequence of elements in them
                            
                                PostgreSQL: insert data into table from json
                            
                                Jackson serialize only interface methods
                            
                                Posting JSON with jquery ajax to PHP
                            
                                Which one is best CSV or JSON in order to import big data (PHP) [closed]
                            
                                Post processing of a Json response in spring MVC
                            
                                Android M + Retrofit + JSON : Can't make field constructor accessible
                            
                                Serialized JSON with sorted keys, using Jackson
                            
                                preserve int64 values when parsing json in Go
                            
                                Generate WebAPI documentation in swagger json format
                            
                                Deserialize Dynamic Json string using Newtonsoft JSON.NET
                            
                                Java Json pretty print javax.json
                            
                                Create JSON String using GSON
                            
                                JSON invalid character '}' looking for beginning of object key string
                            
                                Json handling in ROBOT

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

compact binary representation of json

Tags:

json

format

binary

Shezan Baig

People also ask

1 Answers

Riyad Kalla

Recent Activity

Donate For Us