Storing JSON in BigQuery

Tags:

I have various highly nested json objects. I am wondering whether to store these as STRUCTs in BigQuery or as a STRING. If storing it as a string, then I can use JSON_EXTRACT where necessary to get what I need. I have a few questions on using the following approach:

Would it be a bad idea storing json data as a string instead of record?
Would there be a big performance hit whenever using that json field if it's stored as a string?
What additional advantages would storing the json as a STRUCT instead of a string give?

Finally, I wasn't able to find any place in the documentation that gives examples of how to query STRUCTs. The only place I could find was https://cloud.google.com/bigquery/docs/nested-repeated. Are there examples in the documentation (or elsewhere) on querying nested fields? Additionally, why is the term RECORD and STRUCT used interchangeably on this page?

Note that the json will not be repeated at the root level, i.e., it will look like {...} and not [{...},{...}].

As a reference, in Redshift you would (as of this question) store json as a string and use the json-functions to manipulate it: https://stackoverflow.com/a/32731374/651174.

219

asked Jun 04 '19 18:06

David542

1 Answers

I usually do both:

Store JSON objects as STRINGs for posterity and re-factorings.
Materialize easy-to-query tables from your JSON objects - to get you and your team a better experience when querying.

My 3 steps:

Store everything as JSON strings. Then you won't lose data in case of schema changes, for example.
Create a VIEW that JSON_EXTRACTs data into easy to query columns.
Materialize those views into tables for the best performance and ease.

Then, in case of schema change:

Everything you have stored, stays the same.
You can modify the views to suit the new schema.
You can re-materialize tables into the new schema.

162

answered Oct 17 '22 12:10

Felipe Hoffa

Related questions
                            
                                Swift 4 JSON decoding when type is only known at runtime
                            
                                Ktor: Serialize/Deserialize JSON with List as root in Multiplatform
                            
                                java.lang.NoSuchMethodError: org.json.JSONObject.<init>(Ljava/lang/Object;)V
                            
                                Beginner JavaScript: Working with JSON and Objects in JavaScript
                            
                                Essential Dojo
                            
                                Set JSON content-type on s:HttpService in flex
                            
                                Pause Vimeo universal embed when hidden using jQuery
                            
                                How to turn off rails protect_from_forgery filter only for json
                            
                                jquery tmpl formatting a date?
                            
                                why parseJSON returns null
                            
                                Possible to pretty print JSON in Grails 1.3.7?
                            
                                $.jquery ajax returned data (json) displays as 'undefined'
                            
                                creating a json object in PHP
                            
                                How to get data from ASP.NET MVC controller to jQuery dynamically?
                            
                                google oauth2 how to get private key for service account
                            
                                Get a local json file on NativeScript
                            
                                Dictionary of Pandas' Dataframe to JSON
                            
                                How to use --set to set values with Prometheus chart?
                            
                                Convert JSON into POJO (Object) similar to android in Flutter
                            
                                '+' character converting into space in HttpParams angular 6

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Storing JSON in BigQuery

Tags:

json

struct

google-bigquery

David542

People also ask

1 Answers

Felipe Hoffa

Recent Activity

Donate For Us