Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to Sync Mysql into Bigquery in realtime?

Currently I have some script which first deletes the table and upload the table from MySQL to Bigquery. And many time it had failed. Plus it run only once a day. I am looking for some scalable and realtime solution. Your Help will be much appreciated :)

like image 506
Anurag Jain Avatar asked Nov 16 '16 10:11

Anurag Jain


1 Answers

Read these series of posts from Wepay, where they detail how they sync their MySQL databases to BigQuery, using Airflow:

  • https://wecode.wepay.com/posts/wepays-data-warehouse-bigquery-airflow
  • https://wecode.wepay.com/posts/airflow-wepay
  • (3rd one is about BigQuery)

As a summary (quoting):

  • Setup authentication, connections, DAG.
  • Define which columns to pull from MySQL and load into BigQuery.
  • Choose how to load the data: incrementally, or fully.
  • De-duplicating.
like image 186
Felipe Hoffa Avatar answered Oct 07 '22 03:10

Felipe Hoffa