BigQuery - remove unused column from schema

1 Answers

If your table does not consist of record/repeated type fields - your simple option is:

Select valid columns while filtering out bad records into new temp table

SELECT < list of original columns >
FROM YourTable
WHERE < filter to remove bad entries here >

Write above to temp table - YourTable_Temp
Make a backup copy of "broken" table - YourTable_Backup
Delete YourTable
Copy YourTable_Temp to YourTable
Check if all looks as expected and if so - get rid of temp and backup tables

Please note: the cost of above #1 is exactly the same as action in first bullet in your question. The rest of actions (copy) are free

In case if you have repeated/record fields - you still can execute above plan, but in #1 you will need to use some BigQuery User-Defined Functions to have proper schema in output
You can see below for examples - of course this will require some extra dev - but if you are in critical situation - this should work for you

Create a table with Record type column
create a table with a column type RECORD

I hope, at some point Google BigQuery Team will add better support for cases like yours when you need to manipulate and output repeated/record data, but for now this is a best workaround I found - at least for myself

133

answered Sep 19 '22 16:09

Mikhail Berlyant

Related questions
                            
                                Knowledge Graph API in BigQuery
                            
                                BigQuery replaced most of my Spark jobs, am I missing something?
                            
                                Is there an Autoincrement in BigQuery?
                            
                                How to download all data in a Google BigQuery dataset?
                            
                                java.net.UnknownHostException Unable to resolve host "accounts.google.com": No address associated with hostname while inserting rows in bigquery
                            
                                Do I need create tables manually in Google BigQuery to view the raw event data from Firebase?
                            
                                MongoDB to BigQuery
                            
                                Indexes on BigQuery Table
                            
                                How to flatten an array with UNNEST or any other functions?
                            
                                google-bigquery format date as mm/dd/yyyy in query results
                            
                                BigQuery - IFERROR for standardSQL
                            
                                How to select multiple custom Firebase event parameters in BigQuery?
                            
                                What's a good balance to decide when to partition a table in BigQuery?
                            
                                How to convert an array extracted from a json string field to a bigquery Repeated field?
                            
                                Update with join with BigQuery
                            
                                How is user_engagement event generated in firebase analytics?
                            
                                How can I make integration tests with google cloud bigquery
                            
                                How do I find elements in an array in BigQuery
                            
                                Left outer join in BigQuery on multiple keys doesn't if one of them is null
                            
                                BigQuery select * except nested column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

BigQuery - remove unused column from schema

Tags:

google-bigquery

Lior

People also ask

1 Answers

Mikhail Berlyant

Recent Activity

Donate For Us