Skip to content

Latest commit

 

History

History
24 lines (16 loc) · 825 Bytes

README.md

File metadata and controls

24 lines (16 loc) · 825 Bytes

BigQuery logger handler for Airflow

Installation

pip install airflow-bigquerylogger

Configuration

AIRFLOW__CORE__REMOTE_LOGGING='true'
AIRFLOW__CORE__REMOTE_BASE_LOG_FOLDER='gs://bucket/path'
AIRFLOW__CORE__REMOTE_LOG_CONN_ID='gcs_log'
AIRFLOW__CORE__LOGGING_CONFIG_CLASS='bigquerylogger.config.LOGGING_CLASS'
AIRFLOW__CORE__LOG_BIGQUERY_DATASET='dataset.table'
AIRFLOW__CORE__LOG_BIGQUERY_LIMIT=50

Google Cloud BigQuery

Rows that were written to a table recently via streaming (using the tabledata.insertall method) cannot be modified using UPDATE, DELETE, or MERGE statements. I recommend setting up a table retention!

Credits

Thanks to Bluecore engineering team for this usefull article.