Is there a way to select table_id in a Bigquery Table Wildcard Query

Tags:

google-bigquery

I have a set of day-sharded data where individual entries do not contain the day. I would like to use table wildcards to select all available data and get back data that is grouped by both the column I am interested in and the day that it was captured. Something, in other words, like this:

SELECT table_id, identifier, Sum(AppAnalytic) as AppAnalyticCount 
    FROM  (TABLE_QUERY(database_main,'table_id CONTAINS "Title_" AND length(table_id) >= 4')) 
    GROUP BY identifier, table_id order by AppAnalyticCount DESC LIMIT 10

Of course, this does not actually work because table_id is not visible in the table aggregation resulting from the TABLE_QUERY function. Is there any way to accomplish this? Some sort of join on table metadata perhaps?

397

asked Apr 18 '14 16:04

OverclockedTim

1 Answers

This functionality is available now in BigQuery through _TABLE_SUFFIX pseudocolumn. Full documentation is at https://cloud.google.com/bigquery/docs/querying-wildcard-tables. Couple of things to note:

You will need to use Standard SQL to enable table wildcards
You will have to rename _TABLE_SUFFIX into something else in your SELECT list, i.e. following example illustrates it

SELECT _TABLE_SUFFIX as table_id, ... FROM `MyDataset.MyTablePrefix_*`

132

answered Sep 27 '22 20:09

Mosha Pasumansky

Related questions
                            
                                COUNT(*) FILTER (WHERE... In BigQuery
                            
                                BigQuery JOINs between tables in different projects
                            
                                Extract Month and Year from timestamp in Bigquery
                            
                                Using a timestamp function in a GROUP BY
                            
                                Is it possible to add a new field to an existing field of RECORD type in bigquery from UI?
                            
                                How to choose the latest partition in BigQuery table?
                            
                                No module named cloud while using google.cloud import bigquery
                            
                                How to import a CSV file into a BigQuery table without any column names or schema?
                            
                                Resources Exceeded during query execution
                            
                                How to get gcloud auth activate-service-account persist
                            
                                How do I configure Google BigQuery command line tool to use a Service Account?
                            
                                How to use Bigquery streaming insertall on app engine & python
                            
                                Firebase exported to BigQuery: retention cohorts query
                            
                                Error in Google BigQuery <EOF>
                            
                                No matching signature for operator IN for argument types STRING and {ARRAY<STRING>} (Google BigQuery)
                            
                                Where do you get Google Bigquery usage info (mainly for processed data)
                            
                                How to save a view in BigQuery - Standard SQL Dialect
                            
                                Google bigquery update rows
                            
                                Flattening Google Analytics data (with repeated fields) not working anymore
                            
                                How to calculate Session and Session duration in Firebase Analytics raw data?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With