I have a dataset in BigQuery. This dataset contains multiple tables. I am doing the following steps programmatically using the BigQuery API: <ol> <li>Querying the tables in the dataset - Since my response is too large, I am enabling allowLargeResults parameter and diverting my response to a destination table.</li> <li>I am then exporting the data from the destination table to a GCS bucket.</li> </ol> Requirements: <ul> <li>Suppose my process fails at Step 2, I would like to re-run this step.</li> <li>But before I re-run, I would like to check/verify that the specific destination table named 'xyz' already exists in the dataset. </li> <li>If it exists, I would like to re-run step 2.</li> <li>If it does not exist, I would like to do foo.</li> </ul> How can I do this? Thanks in advance.

Here is a python snippet that will tell whether a table exists (deleting it in the process--careful!): <pre class="prettyprint"><code>def doesTableExist(project_id, dataset_id, table_id): bq.tables().delete( projectId=project_id, datasetId=dataset_id, tableId=table_id).execute() return False </code></pre> Alternately, if you'd prefer not deleting the table in the process, you could try: <pre class="prettyprint"><code>def doesTableExist(project_id, dataset_id, table_id): try: bq.tables().get( projectId=project_id, datasetId=dataset_id, tableId=table_id).execute() return True except HttpError, err if err.resp.status <> 404: raise return False </code></pre> If you want to know where <code>bq</code> came from, you can call <code>build_bq_client</code> from here: http://code.google.com/p/bigquery-e2e/source/browse/samples/ch12/auth.py In general, if you're using this to test whether you should run a job that will modify the table, it can be a good idea to just do the job anyway, and use <code>WRITE_TRUNCATE</code> as a write disposition. Another approach can be to create a predictable job id, and retry the job with that id. If the job already exists, the job already ran (you might want to double check to make sure the job didn't fail, however).

BigQuery - Check if table already exists

2 Answers

Alex F's solution works on v0.27, but will not work on later versions. In order to migrate to v0.28+, the below solution will work.

from google.cloud import bigquery

project_nm = 'gc_project_nm'
dataset_nm = 'ds_nm'
table_nm = 'tbl_nm'

client = bigquery.Client(project_nm)
dataset = client.dataset(dataset_nm)
table_ref = dataset.table(table_nm)

def if_tbl_exists(client, table_ref):
    from google.cloud.exceptions import NotFound
    try:
        client.get_table(table_ref)
        return True
    except NotFound:
        return False

if_tbl_exists(client, table_ref)

121

answered Oct 17 '22 04:10

tarheel

Here is a python snippet that will tell whether a table exists (deleting it in the process--careful!):

def doesTableExist(project_id, dataset_id, table_id):
  bq.tables().delete(
      projectId=project_id, 
      datasetId=dataset_id,
      tableId=table_id).execute()
  return False

Alternately, if you'd prefer not deleting the table in the process, you could try:

def doesTableExist(project_id, dataset_id, table_id):
  try:
    bq.tables().get(
        projectId=project_id, 
        datasetId=dataset_id,
        tableId=table_id).execute()
    return True
  except HttpError, err
    if err.resp.status <> 404:
       raise
    return False

If you want to know where bq came from, you can call build_bq_client from here: http://code.google.com/p/bigquery-e2e/source/browse/samples/ch12/auth.py

In general, if you're using this to test whether you should run a job that will modify the table, it can be a good idea to just do the job anyway, and use WRITE_TRUNCATE as a write disposition.

Another approach can be to create a predictable job id, and retry the job with that id. If the job already exists, the job already ran (you might want to double check to make sure the job didn't fail, however).

answered Oct 17 '22 04:10

Jordan Tigani

Related questions
                            
                                Google Calendar API v3 Access Not Configured
                            
                                Share a Drive document without notifying user with Google Apps Script
                            
                                Google API Client "refresh token must be passed in or set as part of setAccessToken"
                            
                                How to get Google +1 count for current page in PHP?
                            
                                Google URL Shortener 403 Rate Limit Exceeded
                            
                                Not able to display embedded images in HTML using Gmail API + rails
                            
                                Google Sign-In API Hang with uncaught error Failed to get parent origin from URL hash
                            
                                What is the rate limit for direct use of the Google Analytics Measurement Protocol API?
                            
                                Using Google OAuth on localhost
                            
                                What is the usage of the client_secrets.json file?
                            
                                Missing Google APIs for API Level 25
                            
                                Is it possible to play with Google Mirror API without having the device?
                            
                                How do I access the Google Spreadsheets API in PHP?
                            
                                Google Translate API always returning 'Daily Limit Exceeded'
                            
                                Gmail API: 400 bad request when trying to send an email (PHP code)
                            
                                Which Google api to use for getting user's first name, last name, picture, etc?
                            
                                Google API in Javascript
                            
                                Error: "message": "Login Required" when use Youtube Analytics API
                            
                                How to get a picture of a place from google maps or places API
                            
                                How to check if user is logged in or not with "Google Sign In" (OAuth 2.0)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

BigQuery - Check if table already exists

Tags:

export

google-api

google-cloud-storage

google-bigquery

activelearner

People also ask

2 Answers

tarheel

Jordan Tigani

Recent Activity

Donate For Us